A systematic literature review of machine learning in online personal health data. (25th March 2019)
- Record Type:
- Journal Article
- Title:
- A systematic literature review of machine learning in online personal health data. (25th March 2019)
- Main Title:
- A systematic literature review of machine learning in online personal health data
- Authors:
- Yin, Zhijun
Sulieman, Lina M
Malin, Bradley A - Abstract:
- Abstract: Objective: User-generated content (UGC) in online environments provides opportunities to learn an individual's health status outside of clinical settings. However, the nature of UGC brings challenges in both data collecting and processing. The purpose of this study is to systematically review the effectiveness of applying machine learning (ML) methodologies to UGC for personal health investigations. Materials and Methods: We searched PubMed, Web of Science, IEEE Library, ACM library, AAAI library, and the ACL anthology. We focused on research articles that were published in English and in peer-reviewed journals or conference proceedings between 2010 and 2018. Publications that applied ML to UGC with a focus on personal health were identified for further systematic review. Results: We identified 103 eligible studies which we summarized with respect to 5 research categories, 3 data collection strategies, 3 gold standard dataset creation methods, and 4 types of features applied in ML models. Popular off-the-shelf ML models were logistic regression ( n = 22), support vector machines ( n = 18), naive Bayes ( n = 17), ensemble learning ( n = 12), and deep learning ( n = 11). The most investigated problems were mental health ( n = 39) and cancer ( n = 15). Common health-related aspects extracted from UGC were treatment experience, sentiments and emotions, coping strategies, and social support. Conclusions: The systematic review indicated that ML canAbstract: Objective: User-generated content (UGC) in online environments provides opportunities to learn an individual's health status outside of clinical settings. However, the nature of UGC brings challenges in both data collecting and processing. The purpose of this study is to systematically review the effectiveness of applying machine learning (ML) methodologies to UGC for personal health investigations. Materials and Methods: We searched PubMed, Web of Science, IEEE Library, ACM library, AAAI library, and the ACL anthology. We focused on research articles that were published in English and in peer-reviewed journals or conference proceedings between 2010 and 2018. Publications that applied ML to UGC with a focus on personal health were identified for further systematic review. Results: We identified 103 eligible studies which we summarized with respect to 5 research categories, 3 data collection strategies, 3 gold standard dataset creation methods, and 4 types of features applied in ML models. Popular off-the-shelf ML models were logistic regression ( n = 22), support vector machines ( n = 18), naive Bayes ( n = 17), ensemble learning ( n = 12), and deep learning ( n = 11). The most investigated problems were mental health ( n = 39) and cancer ( n = 15). Common health-related aspects extracted from UGC were treatment experience, sentiments and emotions, coping strategies, and social support. Conclusions: The systematic review indicated that ML can be effectively applied to UGC in facilitating the description and inference of personal health. Future research needs to focus on mitigating bias introduced when building study cohorts, creating features from free text, improving clinical creditability of UGC, and model interpretability. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 26:Number 6(2019)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 26:Number 6(2019)
- Issue Display:
- Volume 26, Issue 6 (2019)
- Year:
- 2019
- Volume:
- 26
- Issue:
- 6
- Issue Sort Value:
- 2019-0026-0006-0000
- Page Start:
- 561
- Page End:
- 576
- Publication Date:
- 2019-03-25
- Subjects:
- systematic review -- machine learning -- online environment -- online health community -- social media -- patient portal -- personal health
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocz009 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15152.xml