Customizable Natural Language Processing Biomarker Extraction Tool. (2021)
- Record Type:
- Journal Article
- Title:
- Customizable Natural Language Processing Biomarker Extraction Tool. (2021)
- Main Title:
- Customizable Natural Language Processing Biomarker Extraction Tool
- Authors:
- Holmes, Benjamin
Chitale, Dhananjay
Loving, Joshua
Tran, Mary
Subramanian, Vinod
Berry, Anna
Rioth, Matthew
Warrier, Raghu
Brown, Thomas - Abstract:
- Abstract : PURPOSE: Natural language processing (NLP) in pathology reports to extract biomarker information is an ongoing area of research. MetaMap is a natural language processing tool developed and funded by the National Library of Medicine to map biomedical text to the Unified Medical Language System Metathesaurus by applying specific tags to clinically relevant terms. Although results are useful without additional postprocessing, these tags lack important contextual information. METHODS: Our novel method takes terminology-driven semantic tags and incorporates those into a semantic frame that is task-specific to add necessary context to MetaMap. We use important contextual information to capture biomarker results to support Community Health System's use of Precision Medicine treatments for patients with cancer. For each biomarker, the name, type, numeric quantifiers, non-numeric qualifiers, and the time frame are extracted. These fields then associate biomarkers with their context in the pathology report such as test type, probe intensity, copy-number changes, and even failed results. A selection of 6, 713 relevant reports contained the following standard-of-care biomarkers for metastatic breast cancer: breast cancer gene 1 and 2, estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and programmed death-ligand 1. RESULTS: The method was tested on pathology reports from the internal pathology laboratory at Henry Ford Health System. AAbstract : PURPOSE: Natural language processing (NLP) in pathology reports to extract biomarker information is an ongoing area of research. MetaMap is a natural language processing tool developed and funded by the National Library of Medicine to map biomedical text to the Unified Medical Language System Metathesaurus by applying specific tags to clinically relevant terms. Although results are useful without additional postprocessing, these tags lack important contextual information. METHODS: Our novel method takes terminology-driven semantic tags and incorporates those into a semantic frame that is task-specific to add necessary context to MetaMap. We use important contextual information to capture biomarker results to support Community Health System's use of Precision Medicine treatments for patients with cancer. For each biomarker, the name, type, numeric quantifiers, non-numeric qualifiers, and the time frame are extracted. These fields then associate biomarkers with their context in the pathology report such as test type, probe intensity, copy-number changes, and even failed results. A selection of 6, 713 relevant reports contained the following standard-of-care biomarkers for metastatic breast cancer: breast cancer gene 1 and 2, estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and programmed death-ligand 1. RESULTS: The method was tested on pathology reports from the internal pathology laboratory at Henry Ford Health System. A certified tumor registrar reviewed 400 tests, which showed > 95% accuracy for all extracted biomarker types. CONCLUSION: Using this new method, it is possible to extract high-quality, contextual biomarker information, and this represents a significant advance in biomarker extraction. … (more)
- Is Part Of:
- JCO Clinical Cancer Informatics. Volume 5(2021)
- Journal:
- JCO Clinical Cancer Informatics
- Issue:
- Volume 5(2021)
- Issue Display:
- Volume 5, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 5
- Issue:
- 2021
- Issue Sort Value:
- 2021-0005-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021
- Subjects:
- 616.994
- Journal URLs:
- http://journals.lww.com/pages/default.aspx ↗
- DOI:
- 10.1200/CCI.21.00017 ↗
- Languages:
- English
- ISSNs:
- 2473-4276
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21253.xml