Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports. Issue 3 (13th September 2022)

Record Type:: Journal Article
Title:: Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports. Issue 3 (13th September 2022)
Main Title:: Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports
Authors:: De Angeli, Kevin
Gao, Shang
Blanchard, Andrew
Durbin, Eric B
Wu, Xiao-Cheng
Stroup, Antoinette
Doherty, Jennifer
Schwartz, Stephen M
Wiggins, Charles
Coyle, Linda
Penberthy, Lynne
Tourassi, Georgia
Yoon, Hong-Jun
Abstract:: Lay Summary: One of the goals of the Surveillance, Epidemiology, and End Results (SEER) program is to estimate incidence, prevalence, and mortality of all cancers. To that end, cancer registries across the country maintain a massive database of cancer pathology reports which contain rich information to understand cancer trends. However, these reports are stored in the form of unstructured text, and human annotators are required to read and extract relevant information. In this article, we show that existing deep learning models for automating information extraction from cancer pathology reports can be significantly improved by using ensemble model distillation. We found that by training multiple predictive models and transferring their knowledge to a single, low-resource model, we can reduce the number of highly confident wrong predictions. Our results show that our implemented methods could save 1000s of manual annotation hours. Abstract: Objective: We aim to reduce overfitting and model overconfidence by distilling the knowledge of an ensemble of deep learning models into a single model for the classification of cancer pathology reports. Materials and Methods: We consider the text classification problem that involves 5 individual tasks. The baseline model consists of a multitask convolutional neural network (MtCNN), and the implemented ensemble (teacher) consists of 1000 MtCNNs. We performed knowledge transfer by training a single model (student) with soft labels derived … (more)
Is Part Of:: JAMIA open. Volume 5:Issue 3(2022)
Journal:: JAMIA open
Issue:: Volume 5:Issue 3(2022)
Issue Display:: Volume 5, Issue 3 (2022)
Year:: 2022
Volume:: 5
Issue:: 3
Issue Sort Value:: 2022-0005-0003-0000
Page Start:
Page End:
Publication Date:: 2022-09-13
Subjects:: ensemble distillation -- CNN -- NLP -- deep learning -- selective classification
Medical informatics -- Periodicals
610.285
Journal URLs:: http://www.oxfordjournals.org/ ↗
https://academic.oup.com/jamiaopen ↗
DOI:: 10.1093/jamiaopen/ooac075 ↗
Languages:: English
ISSNs:: 2574-2531
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 23243.xml