Computer aided classification of diagnostic terms in spanish. Issue 6 (15th April 2015)
- Record Type:
- Journal Article
- Title:
- Computer aided classification of diagnostic terms in spanish. Issue 6 (15th April 2015)
- Main Title:
- Computer aided classification of diagnostic terms in spanish
- Authors:
- Pérez, Alicia
Gojenola, Koldo
Casillas, Arantza
Oronoz, Maite
Díaz de Ilarraza, Arantza - Abstract:
- Highlights: Framework to classify Medical Records by their diagnostic terms. Resources: ICD catalogue, SNOMED ontology, Medical Records and synonym dictionaries. Finite-State Transducers efficiently implement soft-matching operations. An F1-measure of 91.2 was achieved on a test-set of 2850 diagnostic terms. Abstract: The goal of this paper is to classify Medical Records (MRs) by their diagnostic terms (DTs) according to the International Classification of Diseases Clinical Modification (ICD-9-CM). The challenge we face is twofold: (i) to treat the natural and non-standard language in which doctors express their diagnostics and (ii) to perform a large-scale classification problem. We propose the use of Finite-State Transducers (FSTs) that, for their underlying topology, constrain the allowed input DT string while synchronously produce the output ICD-9-CM class. It is outstanding their versatility to efficiently implement soft-matching operations between terms expressed in natural language to standard terms and, hence, to the final ICD-9-CM code. The FSTs were built up from a corpora and standard resources such as the ICD-9-CM and SNOMED CT amongst others. Our corpora count on a big-data comprising more than 20, 000 DTs from MRs from the Basque Hospital System so as to model natural language in this domain. An F1-measure of 91.2 was achieved on a test-set of 2850 randomly selected DTs, and a random 5-fold cross validation on a training set served to double-check the stabilityHighlights: Framework to classify Medical Records by their diagnostic terms. Resources: ICD catalogue, SNOMED ontology, Medical Records and synonym dictionaries. Finite-State Transducers efficiently implement soft-matching operations. An F1-measure of 91.2 was achieved on a test-set of 2850 diagnostic terms. Abstract: The goal of this paper is to classify Medical Records (MRs) by their diagnostic terms (DTs) according to the International Classification of Diseases Clinical Modification (ICD-9-CM). The challenge we face is twofold: (i) to treat the natural and non-standard language in which doctors express their diagnostics and (ii) to perform a large-scale classification problem. We propose the use of Finite-State Transducers (FSTs) that, for their underlying topology, constrain the allowed input DT string while synchronously produce the output ICD-9-CM class. It is outstanding their versatility to efficiently implement soft-matching operations between terms expressed in natural language to standard terms and, hence, to the final ICD-9-CM code. The FSTs were built up from a corpora and standard resources such as the ICD-9-CM and SNOMED CT amongst others. Our corpora count on a big-data comprising more than 20, 000 DTs from MRs from the Basque Hospital System so as to model natural language in this domain. An F1-measure of 91.2 was achieved on a test-set of 2850 randomly selected DTs, and a random 5-fold cross validation on a training set served to double-check the stability of the provided results. Real MRs were of much help to adapt the system to natural language. Misspellings, colloquial and specific language and abbreviations made the classification process difficult. The FSTs were proven efficient in this large-scale classification task. Moreover, the composition operation for FSTs made it easy the addition of new features to the system. … (more)
- Is Part Of:
- Expert systems with applications. Volume 42:Issue 6(2015)
- Journal:
- Expert systems with applications
- Issue:
- Volume 42:Issue 6(2015)
- Issue Display:
- Volume 42, Issue 6 (2015)
- Year:
- 2015
- Volume:
- 42
- Issue:
- 6
- Issue Sort Value:
- 2015-0042-0006-0000
- Page Start:
- 2949
- Page End:
- 2958
- Publication Date:
- 2015-04-15
- Subjects:
- Classification of Medical Records -- Natural language processing -- Finite-State Transducers -- Applications in medicine
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2014.11.035 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4918.xml