Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants. (February 2021)
- Record Type:
- Journal Article
- Title:
- Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants. (February 2021)
- Main Title:
- Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants
- Authors:
- Santiso, Sara
Pérez, Alicia
Casillas, Arantza - Abstract:
- Highlights: Clinical text mining is applied to Adverse Drug Reaction (ADR) extraction. An ADR is a cause-effect relation between a drug and a disease. Stages: 1) recognize drugs and diseases; 2) from drug-disease pairs detect ADRs. Deep neural networks resulted robust against class imbalance and lexical variations. Assessed the tolerance of Joint AB-LSTM to noise introduced at entity recognition. Abstract: Background and Objective: This work tackles the Adverse Drug Reaction (ADR) extraction in Electronic Health Records (EHRs) written in Spanish. This task is within the framework of natural language processing. It consists of extracting relations between drug-disease pairs, with the drug as the causing agent of the reaction. To this end, a pipeline is employed: first, relevant clinical entities are recognized (e.g. drugs, active ingredients, findings, symptoms); next, drug-disease candidate pairs are judged as either ADR or non-ADR. To develop this task, it is necessary to tackle some challenges. First, EHRs show high lexical variability. Second, EHRs are scarce due to their sensitive information. Third, the ADR detection stage has to cope with errors derived from the entity recognition. Methods: To develop the ADR detection we decided to employ a deep neural network approach. In order to asses the tolerance to external variations, the system was exposed to different levels of noise. First, with three corpora that contain documents from different hospitals, size and classHighlights: Clinical text mining is applied to Adverse Drug Reaction (ADR) extraction. An ADR is a cause-effect relation between a drug and a disease. Stages: 1) recognize drugs and diseases; 2) from drug-disease pairs detect ADRs. Deep neural networks resulted robust against class imbalance and lexical variations. Assessed the tolerance of Joint AB-LSTM to noise introduced at entity recognition. Abstract: Background and Objective: This work tackles the Adverse Drug Reaction (ADR) extraction in Electronic Health Records (EHRs) written in Spanish. This task is within the framework of natural language processing. It consists of extracting relations between drug-disease pairs, with the drug as the causing agent of the reaction. To this end, a pipeline is employed: first, relevant clinical entities are recognized (e.g. drugs, active ingredients, findings, symptoms); next, drug-disease candidate pairs are judged as either ADR or non-ADR. To develop this task, it is necessary to tackle some challenges. First, EHRs show high lexical variability. Second, EHRs are scarce due to their sensitive information. Third, the ADR detection stage has to cope with errors derived from the entity recognition. Methods: To develop the ADR detection we decided to employ a deep neural network approach. In order to asses the tolerance to external variations, the system was exposed to different levels of noise. First, with three corpora that contain documents from different hospitals, size and class imbalance ratio. Furthermore, it was exposed to cross-corpus relation extraction. Second, we assessed the sensitivity of the ADR detection stage to noise introduced by the automatic Medical Entity Recognition (MER). Results: The system can cope with cross-hospital predictions provided that it was trained with a large corpus. In the most challenging situation an f-measure of 75.2 was achieved. With respect to the tolerance to errors derived from the entity recognition step, with a medical entity recognizer that missed 20 % of the entities, the f-measure in the ADR detection stage decreased to 68.6. Conclusions: The ADR extraction is tackled as a cause-effect relation extraction task between drugs and diseases. It is advisable to employ as many EHRs as possible in order to make more robust the ADR extraction. Despite the entities missed in the MER step, the drop in the performance is not high with the proposed system. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 199(2021)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 199(2021)
- Issue Display:
- Volume 199, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 199
- Issue:
- 2021
- Issue Sort Value:
- 2021-0199-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Natural language processing -- Adverse drug reaction extraction -- Electronic health records -- Deep learning
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2020.105891 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15634.xml