Expressive modeling for trusted big data analytics: techniques and applications in sentiment analysis. Issue 1 (December 2017)
- Record Type:
- Journal Article
- Title:
- Expressive modeling for trusted big data analytics: techniques and applications in sentiment analysis. Issue 1 (December 2017)
- Main Title:
- Expressive modeling for trusted big data analytics: techniques and applications in sentiment analysis
- Authors:
- Tromp, Erik
Pechenizkiy, Mykola
Gaber, Mohamed - Abstract:
- Abstract Background Sentiment analysis becomes ubiquitous for a variety of applications used in marketing, commerce, and public sector. This has been raising a natural interest within the academic research and industry to develop approaches and solutions for ubiquitous sentiment analysis. However, we can observe that most of the academic research focuses on adopting state-of-the-art machine learning techniques for sentiment classification and elements of natural language processing for feature construction and evaluate them on benchmark datasets not regarding much the actual application settings. In industry the focus is on developing platforms, services and customized solutions for certain applications and for different domains. In this work we propose a generic framework for ubiquitous sentiment classification. We discuss the Rule-Based Emission Model (RBEM) algorithm that we employ for polarity detection. Results We show with the experimental results on benchmark datasets and real case studies that the proposed framework and RBEM approach for polarity detection are indeed generic and extendable. Conclusion As the state-of-the-art machine learning techniques produce black-box models for sentiment analysis, they are hard to fine-tune, debug and adapt to new domains. The necessity to collect lots of labeled data from a particular domain is another obstacle. Therefore, in industry rather simplistic approaches are adopted resulting potentially in poor accuracy. The proposedAbstract Background Sentiment analysis becomes ubiquitous for a variety of applications used in marketing, commerce, and public sector. This has been raising a natural interest within the academic research and industry to develop approaches and solutions for ubiquitous sentiment analysis. However, we can observe that most of the academic research focuses on adopting state-of-the-art machine learning techniques for sentiment classification and elements of natural language processing for feature construction and evaluate them on benchmark datasets not regarding much the actual application settings. In industry the focus is on developing platforms, services and customized solutions for certain applications and for different domains. In this work we propose a generic framework for ubiquitous sentiment classification. We discuss the Rule-Based Emission Model (RBEM) algorithm that we employ for polarity detection. Results We show with the experimental results on benchmark datasets and real case studies that the proposed framework and RBEM approach for polarity detection are indeed generic and extendable. Conclusion As the state-of-the-art machine learning techniques produce black-box models for sentiment analysis, they are hard to fine-tune, debug and adapt to new domains. The necessity to collect lots of labeled data from a particular domain is another obstacle. Therefore, in industry rather simplistic approaches are adopted resulting potentially in poor accuracy. The proposed framework for sentiment analysis allows to develop different solutions that are scalable, transparent, and easy to maintain. … (more)
- Is Part Of:
- Big data analytics. Volume 2:Issue 1(2017)
- Journal:
- Big data analytics
- Issue:
- Volume 2:Issue 1(2017)
- Issue Display:
- Volume 2, Issue 1 (2017)
- Year:
- 2017
- Volume:
- 2
- Issue:
- 1
- Issue Sort Value:
- 2017-0002-0001-0000
- Page Start:
- 1
- Page End:
- 28
- Publication Date:
- 2017-12
- Subjects:
- Sentiment analysis -- Trust in Big Data -- Rule-based mining
Big data -- Periodicals
Biology -- Data processing -- Periodicals
570.28557 - Journal URLs:
- https://bdataanalytics.biomedcentral.com/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s41044-016-0018-9 ↗
- Languages:
- English
- ISSNs:
- 2058-6345
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10012.xml