Developing a standardized protocol for computational sentiment analysis research using health-related social media data. (22nd December 2020)
- Record Type:
- Journal Article
- Title:
- Developing a standardized protocol for computational sentiment analysis research using health-related social media data. (22nd December 2020)
- Main Title:
- Developing a standardized protocol for computational sentiment analysis research using health-related social media data
- Authors:
- He, Lu
Yin, Tingjue
Hu, Zhaoxian
Chen, Yunan
Hanauer, David A
Zheng, Kai - Abstract:
- Abstract: Objective: Sentiment analysis is a popular tool for analyzing health-related social media content. However, existing studies exhibit numerous methodological issues and inconsistencies with respect to research design and results reporting, which could lead to biased data, imprecise or incorrect conclusions, or incomparable results across studies. This article reports a systematic analysis of the literature with respect to such issues. The objective was to develop a standardized protocol for improving the research validity and comparability of results in future relevant studies. Materials and Methods: We developed the Protocol of Analysis of senTiment in Health (PATH) based on a systematic review that analyzed common research design choices and how such choices were made, or reported, among eligible studies published 2010-2019. Results: Of 409 articles screened, 89 met the inclusion criteria. A total of 16 distinctive research design choices were identified, 9 of which have significant methodological or reporting inconsistencies among the articles reviewed, ranging from how relevance of study data was determined to how the sentiment analysis tool selected was validated. Based on this result, we developed the PATH protocol that encompasses all these distinctive design choices and highlights the ones for which careful consideration and detailed reporting are particularly warranted. Conclusions: A substantial degree of methodological and reporting inconsistencies existAbstract: Objective: Sentiment analysis is a popular tool for analyzing health-related social media content. However, existing studies exhibit numerous methodological issues and inconsistencies with respect to research design and results reporting, which could lead to biased data, imprecise or incorrect conclusions, or incomparable results across studies. This article reports a systematic analysis of the literature with respect to such issues. The objective was to develop a standardized protocol for improving the research validity and comparability of results in future relevant studies. Materials and Methods: We developed the Protocol of Analysis of senTiment in Health (PATH) based on a systematic review that analyzed common research design choices and how such choices were made, or reported, among eligible studies published 2010-2019. Results: Of 409 articles screened, 89 met the inclusion criteria. A total of 16 distinctive research design choices were identified, 9 of which have significant methodological or reporting inconsistencies among the articles reviewed, ranging from how relevance of study data was determined to how the sentiment analysis tool selected was validated. Based on this result, we developed the PATH protocol that encompasses all these distinctive design choices and highlights the ones for which careful consideration and detailed reporting are particularly warranted. Conclusions: A substantial degree of methodological and reporting inconsistencies exist in the extant literature that applied sentiment analysis to analyzing health-related social media data. The PATH protocol developed through this research may contribute to mitigating such issues in future relevant studies. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 28:Number 6(2021)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 28:Number 6(2021)
- Issue Display:
- Volume 28, Issue 6 (2021)
- Year:
- 2021
- Volume:
- 28
- Issue:
- 6
- Issue Sort Value:
- 2021-0028-0006-0000
- Page Start:
- 1125
- Page End:
- 1134
- Publication Date:
- 2020-12-22
- Subjects:
- sentiment analysis -- reference standard [E05.978.808] -- social media [L01.178.75] -- user-generated content -- Web 2.0 -- Facebook -- Twitter -- Instagram -- natural language processing [L01.224.050.375.580] -- computing methodologies [L01.224] -- machine learning [G17.035.250.500]
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocaa298 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 17232.xml