A hybrid lexicon-based and neural approach for explainable polarity detection. Issue 5 (September 2022)
- Record Type:
- Journal Article
- Title:
- A hybrid lexicon-based and neural approach for explainable polarity detection. Issue 5 (September 2022)
- Main Title:
- A hybrid lexicon-based and neural approach for explainable polarity detection
- Authors:
- Polignano, Marco
Basile, Valerio
Basile, Pierpaolo
Gabrieli, Giuliano
Vassallo, Marco
Bosco, Cristina - Abstract:
- Abstract: In this work, we propose BERT-WMAL, a hybrid model that brings together information coming from data through the recent transformer deep learning model and those obtained from a polarized lexicon. The result is a model for sentence polarity that manages to have performances comparable with those at the state-of-the-art, but with the advantage of being able to provide the end-user with an explanation regarding the most important terms involved with the provided prediction. The model has been evaluated on three polarity detection Italian dataset, i.e., SENTIPOLC, AGRITREND and ABSITA. While the first contains 7, 410 tweets released for training and 2, 000 for testing, the second and the third respectively include 1, 000 tweets without splitting, and 2, 365 reviews for training, 1, 171 for testing. The use of lexicon-based information proves to be effective in terms of the F1 measure since it shows an improvement of F1 score on all the observed dataset: from 0.664 to 0.669 (i.e, 0.772%) on AGRITREND, from 0.728 to 0.734 (i.e., 0.854%) on SENTIPOLC and from 0.904 to 0.921 (i.e, 1.873%) on ABSITA. The usefulness of this model not only depends on its effectiveness in terms of the F1 measure, but also on its ability to generate predictions that are more explainable and especially convincing for the end-users. We evaluated this aspect through a user study involving four native Italian speakers, each evaluating 64 sentences with associated explanations. The resultsAbstract: In this work, we propose BERT-WMAL, a hybrid model that brings together information coming from data through the recent transformer deep learning model and those obtained from a polarized lexicon. The result is a model for sentence polarity that manages to have performances comparable with those at the state-of-the-art, but with the advantage of being able to provide the end-user with an explanation regarding the most important terms involved with the provided prediction. The model has been evaluated on three polarity detection Italian dataset, i.e., SENTIPOLC, AGRITREND and ABSITA. While the first contains 7, 410 tweets released for training and 2, 000 for testing, the second and the third respectively include 1, 000 tweets without splitting, and 2, 365 reviews for training, 1, 171 for testing. The use of lexicon-based information proves to be effective in terms of the F1 measure since it shows an improvement of F1 score on all the observed dataset: from 0.664 to 0.669 (i.e, 0.772%) on AGRITREND, from 0.728 to 0.734 (i.e., 0.854%) on SENTIPOLC and from 0.904 to 0.921 (i.e, 1.873%) on ABSITA. The usefulness of this model not only depends on its effectiveness in terms of the F1 measure, but also on its ability to generate predictions that are more explainable and especially convincing for the end-users. We evaluated this aspect through a user study involving four native Italian speakers, each evaluating 64 sentences with associated explanations. The results demonstrate the validity of this approach based on a combination of weights of attention extracted from the deep learning model and the linguistic knowledge stored in the WMAL lexicon. These considerations allow us to regard the approach provided in this paper as a promising starting point for further works in this research area. Highlights: Definition of an affective lexicon for the Italian language, WMAL. Definition of a hybrid lexicon-deep learning classification model. Definition of an explanation strategy for justifying the classifications obtained. … (more)
- Is Part Of:
- Information processing & management. Volume 59:Issue 5(2022)
- Journal:
- Information processing & management
- Issue:
- Volume 59:Issue 5(2022)
- Issue Display:
- Volume 59, Issue 5 (2022)
- Year:
- 2022
- Volume:
- 59
- Issue:
- 5
- Issue Sort Value:
- 2022-0059-0005-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Sentiment analysis -- Polarity detection -- Lexicon -- WMAL -- BERT -- Explanation -- Deep learning -- Machine learning
Information storage and retrieval systems -- Periodicals
Information science -- Periodicals
Systèmes d'information -- Périodiques
Sciences de l'information -- Périodiques
Information science
Information storage and retrieval systems
Periodicals
658.4038 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064573 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ipm.2022.103058 ↗
- Languages:
- English
- ISSNs:
- 0306-4573
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4493.893000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23305.xml