Inferred joint multigram models for medical term normalization according to ICD. (February 2018)
- Record Type:
- Journal Article
- Title:
- Inferred joint multigram models for medical term normalization according to ICD. (February 2018)
- Main Title:
- Inferred joint multigram models for medical term normalization according to ICD
- Authors:
- Pérez, Alicia
Atutxa, Aitziber
Casillas, Arantza
Gojenola, Koldo
Sellart, Álvaro - Abstract:
- Graphical abstract: Highlights: Health records comprise valuable information written in a spontaneous register. Matching spontaneous terms in standard ICD terminology is challenging: 7.71% found. This work presents a system to aid human ICD coders find standard diagnostic terms. Terminology normalization is tackled based on weighted finite-state transducers. After the normalization, 94.91% accuracy was achieved. Abstract: Background: Electronic Health Records (EHRs) are written using spontaneous natural language. Often, terms do not match standard terminology like the one available through the International Classification of Diseases (ICD). Objective: Information retrieval and exchange can be improved using standard terminology. Our aim is to render diagnostic terms written in spontaneous language in EHRs into the standard framework provided by the ICD. Methods: We tackle diagnostic term normalization employing Weighted Finite-State Transducers (WFSTs). These machines learn how to translate sequences, in the case of our concern, spontaneous representations into standard representations given a set of samples. They are highly flexible and easily adaptable to terminological singularities of each different hospital and practitioner. Besides, we implemented a similarity metric to enhance spontaneous-standard term matching. Results: From the 2850 spontaneous DTs randomly selected we found that only 7.71% were written in their standard form matching the ICD. This WFST-based systemGraphical abstract: Highlights: Health records comprise valuable information written in a spontaneous register. Matching spontaneous terms in standard ICD terminology is challenging: 7.71% found. This work presents a system to aid human ICD coders find standard diagnostic terms. Terminology normalization is tackled based on weighted finite-state transducers. After the normalization, 94.91% accuracy was achieved. Abstract: Background: Electronic Health Records (EHRs) are written using spontaneous natural language. Often, terms do not match standard terminology like the one available through the International Classification of Diseases (ICD). Objective: Information retrieval and exchange can be improved using standard terminology. Our aim is to render diagnostic terms written in spontaneous language in EHRs into the standard framework provided by the ICD. Methods: We tackle diagnostic term normalization employing Weighted Finite-State Transducers (WFSTs). These machines learn how to translate sequences, in the case of our concern, spontaneous representations into standard representations given a set of samples. They are highly flexible and easily adaptable to terminological singularities of each different hospital and practitioner. Besides, we implemented a similarity metric to enhance spontaneous-standard term matching. Results: From the 2850 spontaneous DTs randomly selected we found that only 7.71% were written in their standard form matching the ICD. This WFST-based system enabled matching spontaneous ICDs with a Mean Reciprocal Rank of 0.68, which means that, on average, the right ICD code is found between the first and second position among the normalized set of candidates. This guarantees efficient document exchange and, furthermore, information retrieval. Conclusion: Medical term normalization was achieved with high performance. We found that direct matching of spontaneous terms using standard lexicons leads to unsatisfactory results while normalized hypothesis generation by means of WFST helped to overcome the gap between spontaneous and standard language. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 110(2018)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 110(2018)
- Issue Display:
- Volume 110, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 110
- Issue:
- 2018
- Issue Sort Value:
- 2018-0110-2018-0000
- Page Start:
- 111
- Page End:
- 117
- Publication Date:
- 2018-02
- Subjects:
- International Classification of Diseases -- Electronic Health Records -- Normalization -- Finite State Models
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2017.12.007 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5621.xml