Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing. (October 2019)
- Record Type:
- Journal Article
- Title:
- Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing. (October 2019)
- Main Title:
- Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing
- Authors:
- Zhou, Xin
Wang, Yanshan
Sohn, Sunghwan
Therneau, Terry M.
Liu, Hongfang
Knopman, David S. - Abstract:
- Highlights: Automatic extraction and assessment of lifestyle exposures using free-text EHRs for Alzheimer's disease. Feasibility and accuracy of investigating lifestye risk factors using natural language processing techniques. Patients with Alzheimer's disease might be exposed to more life style risk factors than the controls. Abstract: Introduction: Previous biomedical studies identified many lifestyle exposures that could possibly represent risk factors for dementia in general or dementia due to Alzheimer's disease (AD). These lifestyle exposures are mainly mentioned in free-text electronic health records (EHRs). However, automatic extraction and assessment of these exposures using EHRs remains understudied. Methods: A natural language processing (NLP) approach was adopted to extract lifestyle exposures and intervention strategies from the clinical notes of 260 patients with clinical diagnoses of AD dementia and 260 age-matched cognitively unimpaired persons. Statistics of lifestyle exposures were compared between these two groups. The mapping results of the NLP extraction were evaluated by comparing the results with data captured independently by clinicians. Results: Thirty out of fifty-five potentially relevant lifestyle exposures were mentioned in our clinical note dataset. Twenty-two dietary factors and three substance abuses that were potentially relevant were not found in clinical notes. Patients with AD dementia were significantly exposed to more of the potentialHighlights: Automatic extraction and assessment of lifestyle exposures using free-text EHRs for Alzheimer's disease. Feasibility and accuracy of investigating lifestye risk factors using natural language processing techniques. Patients with Alzheimer's disease might be exposed to more life style risk factors than the controls. Abstract: Introduction: Previous biomedical studies identified many lifestyle exposures that could possibly represent risk factors for dementia in general or dementia due to Alzheimer's disease (AD). These lifestyle exposures are mainly mentioned in free-text electronic health records (EHRs). However, automatic extraction and assessment of these exposures using EHRs remains understudied. Methods: A natural language processing (NLP) approach was adopted to extract lifestyle exposures and intervention strategies from the clinical notes of 260 patients with clinical diagnoses of AD dementia and 260 age-matched cognitively unimpaired persons. Statistics of lifestyle exposures were compared between these two groups. The mapping results of the NLP extraction were evaluated by comparing the results with data captured independently by clinicians. Results: Thirty out of fifty-five potentially relevant lifestyle exposures were mentioned in our clinical note dataset. Twenty-two dietary factors and three substance abuses that were potentially relevant were not found in clinical notes. Patients with AD dementia were significantly exposed to more of the potential risk factors compared to the cognitively unimpaired subjects (χ2 = 120.31, p-value < 0.001). The average accuracy of the automated extraction was 74.0% in comparison with the manual review of randomly selected 50 sample documents. Discussion and conclusion: We illustrated the feasibility of NLP techniques for the automated evaluation of a large number lifestyle habits using free-text EHR data. We found that AD dementia patients were exposed to more of the potential risk factors than the comparison group. Our results also demonstrated the feasibility and accuracy of investigating putative risk factors using NLP techniques. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 130(2019)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 130(2019)
- Issue Display:
- Volume 130, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 130
- Issue:
- 2019
- Issue Sort Value:
- 2019-0130-2019-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-10
- Subjects:
- Alzheimer's disease -- Electronic health records -- Natural language processing -- Lifestyle exposure
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2019.08.003 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11652.xml