Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine. (5th February 2015)

Record Type:: Journal Article
Title:: Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine. (5th February 2015)
Main Title:: Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine
Authors:: Cohen, Aaron M
Smalheiser, Neil R
McDonagh, Marian S
Yu, Clement
Adams, Clive E
Davis, John M
Yu, Philip S
Abstract:: ABSTRACT: Objective: For many literature review tasks, including systematic review (SR) and other aspects of evidence-based medicine, it is important to know whether an article describes a randomized controlled trial (RCT). Current manual annotation is not complete or flexible enough for the SR process. In this work, highly accurate machine learning predictive models were built that include confidence predictions of whether an article is an RCT. Materials and Methods: The LibSVM classifier was used with forward selection of potential feature sets on a large human-related subset of MEDLINE to create a classification model requiring only the citation, abstract, and MeSH terms for each article. Results: The model achieved an area under the receiver operating characteristic curve of 0.973 and mean squared error of 0.013 on the held out year 2011 data. Accurate confidence estimates were confirmed on a manually reviewed set of test articles. A second model not requiring MeSH terms was also created, and performs almost as well. Discussion: Both models accurately rank and predict article RCT confidence. Using the model and the manually reviewed samples, it is estimated that about 8000 (3%) additional RCTs can be identified in MEDLINE, and that 5% of articles tagged as RCTs in Medline may not be identified. Conclusion: Retagging human-related studies with a continuously valued RCT confidence is potentially more useful for article ranking and review than a simple yes/no prediction. … (more)
Is Part Of:: Journal of the American Medical Informatics Association. Volume 22:Number 3(2015:May)
Journal:: Journal of the American Medical Informatics Association
Issue:: Volume 22:Number 3(2015:May)
Issue Display:: Volume 22, Issue 3 (2015)
Year:: 2015
Volume:: 22
Issue:: 3
Issue Sort Value:: 2015-0022-0003-0000
Page Start:: 707
Page End:: 717
Publication Date:: 2015-02-05
Subjects:: Support Vector Machines -- Natural Language Processing -- Randomized Controlled Trials as Topic -- Evidence-Based Medicine -- Systematic Reviews -- Information Retrieval
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285
Journal URLs:: http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗
DOI:: 10.1093/jamia/ocu025 ↗
Languages:: English
ISSNs:: 1067-5027
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 15138.xml