Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing. (22nd February 2022)

Record Type:: Journal Article
Title:: Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing. (22nd February 2022)
Main Title:: Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
Authors:: Xie, Kevin
Gallagher, Ryan S
Conrad, Erin C
Garrick, Chadric O
Baldassano, Steven N
Bernabei, John M
Galer, Peter D
Ghosn, Nina J
Greenblatt, Adam S
Jennings, Tara
Kornspun, Alana
Kulick-Soper, Catherine V
Panchal, Jal M
Pattnaik, Akash R
Scheid, Brittany H
Wei, Danmeng
Weitzman, Micah
Muthukrishnan, Ramya
Kim, Joongwon
Litt, Brian
Ellis, Colin A
Roth, Dan
Abstract:: Abstract: Objective: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. Materials and Methods: We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. Results: The finetuned models (BERTFT, Bio_ClinicalBERTFT, and RoBERTaFT ) achieved near-human performance when classifying patients as seizure free, with BERTFT and Bio_ClinicalBERTFT achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F1 scores over 0.80. The best combination of models was Bio_ClinicalBERTFT for classification, and RoBERTaFT for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. Discussion and Conclusion: Our novel machine reading approach to extracting important clinical outcomes … (more)
Is Part Of:: Journal of the American Medical Informatics Association. Volume 29:Number 5(2022)
Journal:: Journal of the American Medical Informatics Association
Issue:: Volume 29:Number 5(2022)
Issue Display:: Volume 29, Issue 5 (2022)
Year:: 2022
Volume:: 29
Issue:: 5
Issue Sort Value:: 2022-0029-0005-0000
Page Start:: 873
Page End:: 881
Publication Date:: 2022-02-22
Subjects:: electronic medical record -- natural language processing -- epilepsy -- question-answering
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285
Journal URLs:: http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗
DOI:: 10.1093/jamia/ocac018 ↗
Languages:: English
ISSNs:: 1067-5027
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 21290.xml