An analysis of the effects of limited training data in distributed learning scenarios for brain age prediction. (26th October 2022)
- Record Type:
- Journal Article
- Title:
- An analysis of the effects of limited training data in distributed learning scenarios for brain age prediction. (26th October 2022)
- Main Title:
- An analysis of the effects of limited training data in distributed learning scenarios for brain age prediction
- Authors:
- Souza, Raissa
Mouches, Pauline
Wilms, Matthias
Tuladhar, Anup
Langner, Sönke
Forkert, Nils D - Abstract:
- Abstract: Objective: Distributed learning avoids problems associated with central data collection by training models locally at each site. This can be achieved by federated learning (FL) aggregating multiple models that were trained in parallel or training a single model visiting sites sequentially, the traveling model (TM). While both approaches have been applied to medical imaging tasks, their performance in limited local data scenarios remains unknown. In this study, we specifically analyze FL and TM performances when very small sample sizes are available per site. Materials and Methods: 2025 T1-weighted magnetic resonance imaging scans were used to investigate the effect of sample sizes on FL and TM for brain age prediction. We evaluated models across 18 scenarios varying the number of samples per site (1, 2, 5, 10, and 20) and the number of training rounds (20, 40, and 200). Results: Our results demonstrate that the TM outperforms FL, for every sample size examined. In the extreme case when each site provided only one sample, FL achieved a mean absolute error (MAE) of 18.9 ± 0.13 years, while the TM achieved a MAE of 6.21 ± 0.50 years, comparable to central learning (MAE = 5.99 years). Discussion: Although FL is more commonly used, our study demonstrates that TM is the best implementation for small sample sizes. Conclusion: The TM offers new opportunities to apply machine learning models in rare diseases and pediatric research but also allows even small hospitals toAbstract: Objective: Distributed learning avoids problems associated with central data collection by training models locally at each site. This can be achieved by federated learning (FL) aggregating multiple models that were trained in parallel or training a single model visiting sites sequentially, the traveling model (TM). While both approaches have been applied to medical imaging tasks, their performance in limited local data scenarios remains unknown. In this study, we specifically analyze FL and TM performances when very small sample sizes are available per site. Materials and Methods: 2025 T1-weighted magnetic resonance imaging scans were used to investigate the effect of sample sizes on FL and TM for brain age prediction. We evaluated models across 18 scenarios varying the number of samples per site (1, 2, 5, 10, and 20) and the number of training rounds (20, 40, and 200). Results: Our results demonstrate that the TM outperforms FL, for every sample size examined. In the extreme case when each site provided only one sample, FL achieved a mean absolute error (MAE) of 18.9 ± 0.13 years, while the TM achieved a MAE of 6.21 ± 0.50 years, comparable to central learning (MAE = 5.99 years). Discussion: Although FL is more commonly used, our study demonstrates that TM is the best implementation for small sample sizes. Conclusion: The TM offers new opportunities to apply machine learning models in rare diseases and pediatric research but also allows even small hospitals to contribute small datasets. Graphical Abstract: … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 30:Number 1(2023)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 30:Number 1(2023)
- Issue Display:
- Volume 30, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 30
- Issue:
- 1
- Issue Sort Value:
- 2023-0030-0001-0000
- Page Start:
- 112
- Page End:
- 119
- Publication Date:
- 2022-10-26
- Subjects:
- machine learning -- distributed learning -- brain age prediction
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocac204 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 24724.xml