Using natural language processing to extract clinically useful information from Chinese electronic medical records. (April 2019)
- Record Type:
- Journal Article
- Title:
- Using natural language processing to extract clinically useful information from Chinese electronic medical records. (April 2019)
- Main Title:
- Using natural language processing to extract clinically useful information from Chinese electronic medical records
- Authors:
- Chen, Liang
Song, Liting
Shao, Yue
Li, Dewei
Ding, Keyue - Abstract:
- Highlights: Application of natural language processing (NLP) in Chinese electronic medical records (EMRs). Development of the rule-based and hybrid methods. Comparison of the performance of the rule-based and hybrid methods for processing unstructured data. Using the extracted information for the assessment of hepatocellular carcinoma staging. Abstract: Aims: To develop a natural language processing (NLP)-based algorithm for extracting clinically useful information for patients with hepatocellular carcinoma (HCC) from Chinese electronic medical records (EMRs) and use these data for the assessment of HCC staging. Materials and Methods: Clinical documents, including operation notes, radiology and pathology reports, of 92 HCC patients were collected from Chinese EMRs. We randomly grouped these patients into training ( n = 60) and testing ( n = 32) datasets. Rule-based and hybrid methods for extracting information were developed using the training set of manually-annotated operation notes. The method with better performance was used to process other documents. The performance of the algorithm was assessed via calculating the precision, recall and F -score for exact-boundary and partial-boundary matching strategies. The utility of clinically useful information for the HCC staging was assessed in comparison with that manually reviewed. Results: For operation notes, the rule-based and hybrid methods had a precision, recall and F- score ≥ 80% when the exact-boundary andHighlights: Application of natural language processing (NLP) in Chinese electronic medical records (EMRs). Development of the rule-based and hybrid methods. Comparison of the performance of the rule-based and hybrid methods for processing unstructured data. Using the extracted information for the assessment of hepatocellular carcinoma staging. Abstract: Aims: To develop a natural language processing (NLP)-based algorithm for extracting clinically useful information for patients with hepatocellular carcinoma (HCC) from Chinese electronic medical records (EMRs) and use these data for the assessment of HCC staging. Materials and Methods: Clinical documents, including operation notes, radiology and pathology reports, of 92 HCC patients were collected from Chinese EMRs. We randomly grouped these patients into training ( n = 60) and testing ( n = 32) datasets. Rule-based and hybrid methods for extracting information were developed using the training set of manually-annotated operation notes. The method with better performance was used to process other documents. The performance of the algorithm was assessed via calculating the precision, recall and F -score for exact-boundary and partial-boundary matching strategies. The utility of clinically useful information for the HCC staging was assessed in comparison with that manually reviewed. Results: For operation notes, the rule-based and hybrid methods had a precision, recall and F- score ≥ 80% when the exact-boundary and partial-boundary matching strategies were applied to the testing dataset. By using the rule-based method (which has better performance than the hybrid method), three other types of documents also obtained good performance. When the extracted clinically useful information was applied for the HCC staging, the concordance rate with the manual review was 75%. Conclusion: A NLP system was developed for clinical information extraction and HCC staging based on EMRs, and the results indicate that Chinese NLP has potential utility in clinical research. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 124(2019)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 124(2019)
- Issue Display:
- Volume 124, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 124
- Issue:
- 2019
- Issue Sort Value:
- 2019-0124-2019-0000
- Page Start:
- 6
- Page End:
- 12
- Publication Date:
- 2019-04
- Subjects:
- Chinese EMRs -- Cancer of liver Italian p (CLIP) -- Regular expression -- Rule-based method -- Hybrid method
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2019.01.004 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25122.xml