A genetic algorithm enabled ensemble for unsupervised medical term extraction from clinical letters. Issue 1 (December 2015)
- Record Type:
- Journal Article
- Title:
- A genetic algorithm enabled ensemble for unsupervised medical term extraction from clinical letters. Issue 1 (December 2015)
- Main Title:
- A genetic algorithm enabled ensemble for unsupervised medical term extraction from clinical letters
- Authors:
- Liu, Wei
Chung, Bo
Wang, Rui
Ng, Jonathon
Morlet, Nigel - Abstract:
- Abstract Despite the rapid global movement towards electronic health records, clinical letters written in unstructured natural languages are still the preferred form of inter-practitioner communication about patients. These letters, when archived over a long period of time, provide invaluable longitudinal clinical details on individual and populations of patients. In this paper we present three unsupervised approaches, sequential pattern mining (PrefixSpan); frequency linguistic based C-Value ; and keyphrase extraction from co-occurrence graphs (TextRank), to automatically extract single and multi-word medical terms without domain-specific knowledge. Because each of the three approaches focuses on different aspects of the language feature space, we propose a genetic algorithm to learn the best parameters of linearly integrating the three extractors for optimal performance against domain expert annotations. Around 30, 000 clinical letters sent over the past decade from ophthalmology specialists to general practitioners at an eye clinic are anonymised as the corpus to evaluate the effectiveness of the ensemble against individual extractors. With minimal annotation, the ensemble achieves an average F-measure of 65.65 % when considering only complex medical terms, and a F-measure of 72.47 % if we take single word terms (i.e. unigrams) into consideration, markedly better than the three term extraction techniques when used alone.
- Is Part Of:
- Health information science and systems. Volume 3:Issue 1(2015)
- Journal:
- Health information science and systems
- Issue:
- Volume 3:Issue 1(2015)
- Issue Display:
- Volume 3, Issue 1 (2015)
- Year:
- 2015
- Volume:
- 3
- Issue:
- 1
- Issue Sort Value:
- 2015-0003-0001-0000
- Page Start:
- 1
- Page End:
- 14
- Publication Date:
- 2015-12
- Subjects:
- Clinical term extraction -- Sequence mining algorithms -- Genetic algorithm
Medical informatics -- Periodicals
Medicine -- Data processing -- Periodicals
Medical Informatics -- Periodicals
Medical informatics
Medicine -- Data processing
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://bibpurl.oclc.org/web/51362 ↗
http://www.hissjournal.com/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s13755-015-0013-y ↗
- Languages:
- English
- ISSNs:
- 2047-2501
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10199.xml