Applying interpretable deep learning models to identify chronic cough patients using EHR data. (October 2021)
- Record Type:
- Journal Article
- Title:
- Applying interpretable deep learning models to identify chronic cough patients using EHR data. (October 2021)
- Main Title:
- Applying interpretable deep learning models to identify chronic cough patients using EHR data
- Authors:
- Luo, Xiao
Gandhi, Priyanka
Zhang, Zuoyi
Shao, Wei
Han, Zhi
Chandrasekaran, Vasu
Turzhitsky, Vladimir
Bali, Vishal
Roberts, Anna R.
Metzger, Megan
Baker, Jarod
La Rosa, Carmen
Weaver, Jessica
Dexter, Paul
Huang, Kun - Abstract:
- Highlights: Evaluated language-based data representation with different learning models to predict chronic cough patients using EHR data. The best approach can gain 0.952 and 0.930 sensitivity and specificity, respectively. Identify chronic cough patients for research through medication and diagnosis data. Deep learning attention mechanisms extracted features that drive the prediction decisions. Abstract: Background and Objective: Chronic cough (CC) affects approximately 10% of adults. Many disease states are associated with chronic cough, such as asthma, upper airway cough syndrome, bronchitis, and gastroesophageal reflux disease. The lack of an ICD code specific for chronic cough makes it challenging to identify such patients from electronic health records (EHRs). For clinical and research purposes, computational methods using EHR data are urgently needed to identify chronic cough cases. This research aims to investigate the data representations and deep learning algorithms for chronic cough prediction. Methods: Utilizing real-world EHR data from a large academic healthcare system from October 2005 to September 2015, we investigated Natural Language Representation of the EHR data and systematically evaluated deep learning and traditional machine learning models to predict chronic cough patients. We built these machine learning models using structured data (medication and diagnosis) and unstructured data (clinical notes). Results: The sensitivity and specificity of aHighlights: Evaluated language-based data representation with different learning models to predict chronic cough patients using EHR data. The best approach can gain 0.952 and 0.930 sensitivity and specificity, respectively. Identify chronic cough patients for research through medication and diagnosis data. Deep learning attention mechanisms extracted features that drive the prediction decisions. Abstract: Background and Objective: Chronic cough (CC) affects approximately 10% of adults. Many disease states are associated with chronic cough, such as asthma, upper airway cough syndrome, bronchitis, and gastroesophageal reflux disease. The lack of an ICD code specific for chronic cough makes it challenging to identify such patients from electronic health records (EHRs). For clinical and research purposes, computational methods using EHR data are urgently needed to identify chronic cough cases. This research aims to investigate the data representations and deep learning algorithms for chronic cough prediction. Methods: Utilizing real-world EHR data from a large academic healthcare system from October 2005 to September 2015, we investigated Natural Language Representation of the EHR data and systematically evaluated deep learning and traditional machine learning models to predict chronic cough patients. We built these machine learning models using structured data (medication and diagnosis) and unstructured data (clinical notes). Results: The sensitivity and specificity of a transformer-based deep learning algorithm, specifically BERT with attention model, was 0.856 and 0.866, respectively, using structured data (medication and diagnosis). Sensitivity and specificity improved to 0.952 and 0.930 when we combined structured data with symptoms extracted from clinical notes. We further found that the attention mechanism of deep learning models can be used to extract important features that drive the prediction decisions. Compared with our previously published rule-based algorithm, the deep learning algorithm can identify more chronic cough patients with structured data. Conclusions: By applying deep learning models, chronic cough patients can be reliably identified for prospective or retrospective research through medication and diagnosis data, widely available in EHR and electronic claims data, thus improving the generalizability of the patient identification algorithm. Deep learning models can identify chronic cough patients with even higher sensitivity and specificity when structured and unstructured EHR data are utilized. We anticipate language-based data representation and deep learning models developed in this research could also be productively used for other disease prediction and case identification. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 210(2021)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 210(2021)
- Issue Display:
- Volume 210, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 210
- Issue:
- 2021
- Issue Sort Value:
- 2021-0210-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-10
- Subjects:
- Machine learning -- Deep learning -- Algorithms -- Chronic cough -- Electronic health records -- Nlp
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2021.106395 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19165.xml