A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions. (30th January 2014)
- Record Type:
- Journal Article
- Title:
- A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions. (30th January 2014)
- Main Title:
- A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions
- Authors:
- Wiens, Jenna
Guttag, John
Horvitz, Eric - Abstract:
- Abstract: Background Data-driven risk stratification models built using data from a single hospital often have a paucity of training data. However, leveraging data from other hospitals can be challenging owing to institutional differences with patients and with data coding and capture. Objective To investigate three approaches to learning hospital-specific predictions about the risk of hospital-associated infection with Clostridium difficile, and perform a comparative analysis of the value of different ways of using external data to enhance hospital-specific predictions. Materials and methods We evaluated each approach on 132 853 admissions from three hospitals, varying in size and location. The first approach was a single-task approach, in which only training data from the target hospital (ie, the hospital for which the model was intended) were used. The second used only data from the other two hospitals. The third approach jointly incorporated data from all hospitals while seeking a solution in the target space. Results The relative performance of the three different approaches was found to be sensitive to the hospital selected as the target. However, incorporating data from all hospitals consistently had the highest performance. Discussion The results characterize the challenges and opportunities that come with (1) using data or models from collections of hospitals without adapting them to the site at which the model will be used, and (2) using only local data to buildAbstract: Background Data-driven risk stratification models built using data from a single hospital often have a paucity of training data. However, leveraging data from other hospitals can be challenging owing to institutional differences with patients and with data coding and capture. Objective To investigate three approaches to learning hospital-specific predictions about the risk of hospital-associated infection with Clostridium difficile, and perform a comparative analysis of the value of different ways of using external data to enhance hospital-specific predictions. Materials and methods We evaluated each approach on 132 853 admissions from three hospitals, varying in size and location. The first approach was a single-task approach, in which only training data from the target hospital (ie, the hospital for which the model was intended) were used. The second used only data from the other two hospitals. The third approach jointly incorporated data from all hospitals while seeking a solution in the target space. Results The relative performance of the three different approaches was found to be sensitive to the hospital selected as the target. However, incorporating data from all hospitals consistently had the highest performance. Discussion The results characterize the challenges and opportunities that come with (1) using data or models from collections of hospitals without adapting them to the site at which the model will be used, and (2) using only local data to build models for small institutions or rare events. Conclusions We show how external data from other hospitals can be successfully and efficiently incorporated into hospital-specific models. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 21:Number 4(2014:Jul.)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 21:Number 4(2014:Jul.)
- Issue Display:
- Volume 21, Issue 4 (2014)
- Year:
- 2014
- Volume:
- 21
- Issue:
- 4
- Issue Sort Value:
- 2014-0021-0004-0000
- Page Start:
- 699
- Page End:
- 706
- Publication Date:
- 2014-01-30
- Subjects:
- transfer learning -- c. difficile -- risk stratification -- electronic health records -- predictive models
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1136/amiajnl-2013-002162 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15161.xml