Text segmentation for patent claim simplification via Bidirectional Long‐Short Term Memory and Conditional Random Field. (14th May 2021)
- Record Type:
- Journal Article
- Title:
- Text segmentation for patent claim simplification via Bidirectional Long‐Short Term Memory and Conditional Random Field. (14th May 2021)
- Main Title:
- Text segmentation for patent claim simplification via Bidirectional Long‐Short Term Memory and Conditional Random Field
- Authors:
- Geng, Boting
- Other Names:
- Lv Zhihan guestEditor.
Lloret Jaime guestEditor.
Song Houbing guestEditor. - Abstract:
- Abstract: Text simplification is a vital work for comprehending patent claims due to its complex syntactic structures and lengthy sentences. Therefore, almost all patent analysis practitioners cannot be able to directly and intuitively understand patent essence even through some common natural language processing (NLP) tools are applied to parse these patent claim paragraph or sentences. Universal text analysis tools above is almost useless, or even crashed when applied to some complex paragraphs of patent claims. Therefore, it is necessary to propose a patent text oriented simplification approach to help patent researchers grasp the essence of patent quickly and intuitively. Motivated by the above reason, we in this article propose a simplification method based on deep learning to segment patent claim into shorter and comprehensible sentences for downstream tasks of patent analysis. The proposed approach contains two stages: on one stage, we use a machine learning approach of conditional random field (CRF) to decompose syntactically complex paragraphs into coarse‐grained level sentences with simplified structures and complete semantics; on another stage, a deep Learning architecture of bidirectional long‐short term memory (Bi‐LSTM)‐CRF is applied to segment coarse‐grained and lengthy sentences of former stage into fined‐grained and shorter sentences. Compared with a series of baselines, our patent segmentation architecture based on deep learning of Bi‐LSTM‐CRF achievesAbstract: Text simplification is a vital work for comprehending patent claims due to its complex syntactic structures and lengthy sentences. Therefore, almost all patent analysis practitioners cannot be able to directly and intuitively understand patent essence even through some common natural language processing (NLP) tools are applied to parse these patent claim paragraph or sentences. Universal text analysis tools above is almost useless, or even crashed when applied to some complex paragraphs of patent claims. Therefore, it is necessary to propose a patent text oriented simplification approach to help patent researchers grasp the essence of patent quickly and intuitively. Motivated by the above reason, we in this article propose a simplification method based on deep learning to segment patent claim into shorter and comprehensible sentences for downstream tasks of patent analysis. The proposed approach contains two stages: on one stage, we use a machine learning approach of conditional random field (CRF) to decompose syntactically complex paragraphs into coarse‐grained level sentences with simplified structures and complete semantics; on another stage, a deep Learning architecture of bidirectional long‐short term memory (Bi‐LSTM)‐CRF is applied to segment coarse‐grained and lengthy sentences of former stage into fined‐grained and shorter sentences. Compared with a series of baselines, our patent segmentation architecture based on deep learning of Bi‐LSTM‐CRF achieves higher performance than any other methods on the evaluation measures of precision, recall, and F1. … (more)
- Is Part Of:
- Computational intelligence. Volume 38:Number 1(2022)
- Journal:
- Computational intelligence
- Issue:
- Volume 38:Number 1(2022)
- Issue Display:
- Volume 38, Issue 1 (2022)
- Year:
- 2022
- Volume:
- 38
- Issue:
- 1
- Issue Sort Value:
- 2022-0038-0001-0000
- Page Start:
- 205
- Page End:
- 215
- Publication Date:
- 2021-05-14
- Subjects:
- Bi‐LSTM -- CRF -- patent claim -- sentence segmentation
Artificial intelligence -- Periodicals
Computational linguistics -- Periodicals
006.3 - Journal URLs:
- http://www.blackwellpublishing.com/journal.asp?ref=0824-7935&site=1 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/coin.12455 ↗
- Languages:
- English
- ISSNs:
- 0824-7935
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.595000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21104.xml