Improving Methods of Identifying Anaphylaxis for Medical Product Safety Surveillance Using Natural Language Processing and Machine Learning. Issue 2 (4th November 2022)
- Record Type:
- Journal Article
- Title:
- Improving Methods of Identifying Anaphylaxis for Medical Product Safety Surveillance Using Natural Language Processing and Machine Learning. Issue 2 (4th November 2022)
- Main Title:
- Improving Methods of Identifying Anaphylaxis for Medical Product Safety Surveillance Using Natural Language Processing and Machine Learning
- Authors:
- Carrell, David S
Gruber, Susan
Floyd, James S
Bann, Maralyssa A
Cushing-Haugen, Kara L
Johnson, Ron L
Graham, Vina
Cronkite, David J
Hazlehurst, Brian L
Felcher, Andrew H
Bejan, Cosmin A
Kennedy, Adee
Shinde, Mayura U
Karami, Sara
Ma, Yong
Stojanovic, Danijela
Zhao, Yueqin
Ball, Robert
Nelson, Jennifer C - Abstract:
- Abstract: We sought to determine whether machine learning and natural language processing (NLP) applied to electronic medical records could improve performance of automated health-care claims-based algorithms to identify anaphylaxis events using data on 516 patients with outpatient, emergency department, or inpatient anaphylaxis diagnosis codes during 2015–2019 in 2 integrated health-care institutions in the Northwest United States. We used one site's manually reviewed gold-standard outcomes data for model development and the other's for external validation based on cross-validated area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), and sensitivity. In the development site 154 (64%) of 239 potential events met adjudication criteria for anaphylaxis compared with 180 (65%) of 277 in the validation site. Logistic regression models using only structured claims data achieved a cross-validated AUC of 0.58 (95% CI: 0.54, 0.63). Machine learning improved cross-validated AUC to 0.62 (0.58, 0.66); incorporating NLP-derived covariates further increased cross-validated AUCs to 0.70 (0.66, 0.75) in development and 0.67 (0.63, 0.71) in external validation data. A classification threshold with cross-validated PPV of 79% and cross-validated sensitivity of 66% in development data had cross-validated PPV of 78% and cross-validated sensitivity of 56% in external data. Machine learning and NLP-derived data improved identification of validatedAbstract: We sought to determine whether machine learning and natural language processing (NLP) applied to electronic medical records could improve performance of automated health-care claims-based algorithms to identify anaphylaxis events using data on 516 patients with outpatient, emergency department, or inpatient anaphylaxis diagnosis codes during 2015–2019 in 2 integrated health-care institutions in the Northwest United States. We used one site's manually reviewed gold-standard outcomes data for model development and the other's for external validation based on cross-validated area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), and sensitivity. In the development site 154 (64%) of 239 potential events met adjudication criteria for anaphylaxis compared with 180 (65%) of 277 in the validation site. Logistic regression models using only structured claims data achieved a cross-validated AUC of 0.58 (95% CI: 0.54, 0.63). Machine learning improved cross-validated AUC to 0.62 (0.58, 0.66); incorporating NLP-derived covariates further increased cross-validated AUCs to 0.70 (0.66, 0.75) in development and 0.67 (0.63, 0.71) in external validation data. A classification threshold with cross-validated PPV of 79% and cross-validated sensitivity of 66% in development data had cross-validated PPV of 78% and cross-validated sensitivity of 56% in external data. Machine learning and NLP-derived data improved identification of validated anaphylaxis events. … (more)
- Is Part Of:
- American journal of epidemiology. Volume 192:Issue 2(2023)
- Journal:
- American journal of epidemiology
- Issue:
- Volume 192:Issue 2(2023)
- Issue Display:
- Volume 192, Issue 2 (2023)
- Year:
- 2023
- Volume:
- 192
- Issue:
- 2
- Issue Sort Value:
- 2023-0192-0002-0000
- Page Start:
- 283
- Page End:
- 295
- Publication Date:
- 2022-11-04
- Subjects:
- anaphylaxis -- electronic health records -- health outcome identification -- machine learning, supervised -- postmarketing product surveillance -- predictive modeling
Epidemiology -- Periodicals
Public health -- Periodicals
614.4 - Journal URLs:
- http://aje.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/aje/kwac182 ↗
- Languages:
- English
- ISSNs:
- 0002-9262
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0824.600000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25695.xml