Multi‐Center Colonoscopy Quality Measurement Utilizing Natural Language Processing. (April 2015)
- Record Type:
- Journal Article
- Title:
- Multi‐Center Colonoscopy Quality Measurement Utilizing Natural Language Processing. (April 2015)
- Main Title:
- Multi‐Center Colonoscopy Quality Measurement Utilizing Natural Language Processing
- Authors:
- Imler, Timothy D
Morea, Justin
Kahi, Charles
Cardwell, Jon
Johnson, Cynthia S
Xu, Huiping
Ahnen, Dennis
Antaki, Fadi
Ashley, Christopher
Baffy, Gyorgy
Cho, Ilseung
Dominitz, Jason
Hou, Jason
Korsten, Mark
Nagar, Anil
Promrat, Kittichai
Robertson, Douglas
Saini, Sameer
Shergill, Amandeep
Smalley, Walter
Imperiale, Thomas F - Abstract:
- Abstract : Background: An accurate system for tracking of colonoscopy quality and surveillance intervals could improve the effectiveness and cost‐effectiveness of colorectal cancer (CRC) screening and surveillance. The purpose of this study was to create and test such a system across multiple institutions utilizing natural language processing (NLP). Methods: From 42, 569 colonoscopies with pathology records from 13 centers, we randomly sampled 750 paired reports. We trained ( n =250) and tested ( n =500) an NLP‐based program with 19 measurements that encompass colonoscopy quality measures and surveillance interval determination, using blinded, paired, annotated expert manual review as the reference standard. The remaining 41, 819 nonannotated documents were processed through the NLP system without manual review to assess performance consistency. The primary outcome was system accuracy across the 19 measures. Results: A total of 176 (23.5%) documents with 252 (1.8%) discrepant content points resulted from paired annotation. Error rate within the 500 test documents was 31.2% for NLP and 25.4% for the paired annotators ( P =0.001). At the content point level within the test set, the error rate was 3.5% for NLP and 1.9% for the paired annotators ( P =0.04). When eight vaguely worded documents were removed, 125 of 492 (25.4%) were incorrect by NLP and 104 of 492 (21.1%) by the initial annotator ( P =0.07). Rates of pathologic findings calculated from NLP were similar to thoseAbstract : Background: An accurate system for tracking of colonoscopy quality and surveillance intervals could improve the effectiveness and cost‐effectiveness of colorectal cancer (CRC) screening and surveillance. The purpose of this study was to create and test such a system across multiple institutions utilizing natural language processing (NLP). Methods: From 42, 569 colonoscopies with pathology records from 13 centers, we randomly sampled 750 paired reports. We trained ( n =250) and tested ( n =500) an NLP‐based program with 19 measurements that encompass colonoscopy quality measures and surveillance interval determination, using blinded, paired, annotated expert manual review as the reference standard. The remaining 41, 819 nonannotated documents were processed through the NLP system without manual review to assess performance consistency. The primary outcome was system accuracy across the 19 measures. Results: A total of 176 (23.5%) documents with 252 (1.8%) discrepant content points resulted from paired annotation. Error rate within the 500 test documents was 31.2% for NLP and 25.4% for the paired annotators ( P =0.001). At the content point level within the test set, the error rate was 3.5% for NLP and 1.9% for the paired annotators ( P =0.04). When eight vaguely worded documents were removed, 125 of 492 (25.4%) were incorrect by NLP and 104 of 492 (21.1%) by the initial annotator ( P =0.07). Rates of pathologic findings calculated from NLP were similar to those calculated by annotation for the majority of measurements. Test set accuracy was 99.6% for CRC, 95% for advanced adenoma, 94.6% for nonadvanced adenoma, 99.8% for advanced sessile serrated polyps, 99.2% for nonadvanced sessile serrated polyps, 96.8% for large hyperplastic polyps, and 96.0% for small hyperplastic polyps. Lesion location showed high accuracy (87.0–99.8%). Accuracy for number of adenomas was 92%. Conclusions: NLP can accurately report adenoma detection rate and the components for determining guideline‐adherent colonoscopy surveillance intervals across multiple sites that utilize different methods for reporting colonoscopy findings. … (more)
- Is Part Of:
- American journal of gastroenterology. Volume 110:Number 4(2015)
- Journal:
- American journal of gastroenterology
- Issue:
- Volume 110:Number 4(2015)
- Issue Display:
- Volume 110, Issue 4 (2015)
- Year:
- 2015
- Volume:
- 110
- Issue:
- 4
- Issue Sort Value:
- 2015-0110-0004-0000
- Page Start:
- Page End:
- Publication Date:
- 2015-04
- Subjects:
- Stomach -- Diseases -- Periodicals
Intestines -- Diseases -- Periodicals
Gastroenterology -- Periodicals
Gastrointestinal Diseases -- Periodicals
Electronic journals
Periodicals
616.33 - Journal URLs:
- http://www.mdconsult.com/public/search?search_type=journal&j_sort=pub_date&j_date_range=1995-current&j_issn=0002-9270 ↗
http://www.amjgastro.com/ ↗
http://www.nature.com/ajg/archive/index.html ↗
http://www.sciencedirect.com/science/journal/00029270 ↗
http://www.nature.com/ ↗
http://www3.interscience.wiley.com/journal/117955841/home ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0002-9270;screen=info;ECOIP ↗ - DOI:
- 10.1038/ajg.2015.51 ↗
- Languages:
- English
- ISSNs:
- 0002-9270
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0824.650000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15944.xml