Identifying Symptom Information in Clinical Notes Using Natural Language Processing. Issue 3 (May 2021)
- Record Type:
- Journal Article
- Title:
- Identifying Symptom Information in Clinical Notes Using Natural Language Processing. Issue 3 (May 2021)
- Main Title:
- Identifying Symptom Information in Clinical Notes Using Natural Language Processing
- Authors:
- Koleck, Theresa A.
Tatonetti, Nicholas P.
Bakken, Suzanne
Mitha, Shazia
Henderson, Morgan M.
George, Maureen
Miaskowski, Christine
Smaldone, Arlene
Topaz, Maxim - Abstract:
- Abstract : Background: Symptoms are a core concept of nursing interest. Large-scale secondary data reuse of notes in electronic health records (EHRs) has the potential to increase the quantity and quality of symptom research. However, the symptom language used in clinical notes is complex. A need exists for methods designed specifically to identify and study symptom information from EHR notes. Objectives: We aim to describe a method that combines standardized vocabularies, clinical expertise, and natural language processing to generate comprehensive symptom vocabularies and identify symptom information in EHR notes. We piloted this method with five diverse symptom concepts: constipation, depressed mood, disturbed sleep, fatigue, and palpitations . Methods: First, we obtained synonym lists for each pilot symptom concept from the Unified Medical Language System. Then, we used two large bodies of text (clinical notes from Columbia University Irving Medical Center and PubMed abstracts containing Medical Subject Headings or key words related to the pilot symptoms) to further expand our initial vocabulary of synonyms for each pilot symptom concept. We used NimbleMiner, an open-source natural language processing tool, to accomplish these tasks and evaluated NimbleMiner symptom identification performance by comparison to a manually annotated set of nurse- and physician-authored common EHR note types. Results: Compared to the baseline Unified Medical Language System synonym lists, weAbstract : Background: Symptoms are a core concept of nursing interest. Large-scale secondary data reuse of notes in electronic health records (EHRs) has the potential to increase the quantity and quality of symptom research. However, the symptom language used in clinical notes is complex. A need exists for methods designed specifically to identify and study symptom information from EHR notes. Objectives: We aim to describe a method that combines standardized vocabularies, clinical expertise, and natural language processing to generate comprehensive symptom vocabularies and identify symptom information in EHR notes. We piloted this method with five diverse symptom concepts: constipation, depressed mood, disturbed sleep, fatigue, and palpitations . Methods: First, we obtained synonym lists for each pilot symptom concept from the Unified Medical Language System. Then, we used two large bodies of text (clinical notes from Columbia University Irving Medical Center and PubMed abstracts containing Medical Subject Headings or key words related to the pilot symptoms) to further expand our initial vocabulary of synonyms for each pilot symptom concept. We used NimbleMiner, an open-source natural language processing tool, to accomplish these tasks and evaluated NimbleMiner symptom identification performance by comparison to a manually annotated set of nurse- and physician-authored common EHR note types. Results: Compared to the baseline Unified Medical Language System synonym lists, we identified up to 11 times more additional synonym words or expressions, including abbreviations, misspellings, and unique multiword combinations, for each symptom concept. Natural language processing system symptom identification performance was excellent. Discussion: Using our comprehensive symptom vocabularies and NimbleMiner to label symptoms in clinical notes produced excellent performance metrics. The ability to extract symptom information from EHR notes in an accurate and scalable manner has the potential to greatly facilitate symptom science research. Abstract : Supplemental digital content is available in the text. … (more)
- Is Part Of:
- Nursing research. Volume 70:Issue 3(2021)
- Journal:
- Nursing research
- Issue:
- Volume 70:Issue 3(2021)
- Issue Display:
- Volume 70, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 70
- Issue:
- 3
- Issue Sort Value:
- 2021-0070-0003-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-05
- Subjects:
- electronic health records -- natural language processing -- signs and symptoms
Nursing -- Research -- Periodicals
Nursing -- Periodicals
Nursing -- Periodicals
Soins infirmiers -- Recherche -- Périodiques
Soins infirmiers -- Périodiques
Verpleegkunde
Nursing
Nursing -- Research
Periodicals
610.73 - Journal URLs:
- http://books.google.com/books?id=84oaAQAAMAAJ ↗
http://books.google.com/books?id=XKdRAQAAIAAJ ↗
http://books.google.com/books?id=1adRAQAAIAAJ ↗
http://catalog.hathitrust.org/api/volumes/oclc/1760937.html ↗
http://136.142.56.160/ovidweb/ovidweb.cgi?T=JS&MODE=ovid&NEWS=N&PAGE=toc&D=ovid_ovft&AN=00006199-000000000-00000 ↗
http://www.nursingresearchonline.com ↗
http://gateway.ovid.com/ovidweb.cgi?T=JS&PAGE=toc&D=ovft&MODE=ovid&NEWS=N&AN=00002060-000000000-00000 ↗
http://journals.lww.com/nursingresearchonline/pages/default.aspx ↗
http://journals.lww.com ↗ - DOI:
- 10.1097/NNR.0000000000000488 ↗
- Languages:
- English
- ISSNs:
- 0029-6562
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6187.110000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25582.xml