A qualitative analysis of sarcasm, irony and related #hashtags on Twitter. Issue 2 (November 2020)
- Record Type:
- Journal Article
- Title:
- A qualitative analysis of sarcasm, irony and related #hashtags on Twitter. Issue 2 (November 2020)
- Main Title:
- A qualitative analysis of sarcasm, irony and related #hashtags on Twitter
- Authors:
- Sykora, Martin
Elayan, Suzanne
Jackson, Thomas W - Abstract:
- As the use of automated social media analysis tools surges, concerns over accuracy of analytics have increased. Some tentative evidence suggests that sarcasm alone could account for as much as a 50% drop in accuracy when automatically detecting sentiment. This paper assesses and outlines the prevalence of sarcastic and ironic language within social media posts. Several past studies proposed models for automatic sarcasm and irony detection for sentiment analysis; however, these approaches result in models trained on training data of highly questionable quality, with little qualitative appreciation of the underlying data. To understand the issues and scale of the problem, we are the first to conduct and present results of a focused manual semantic annotation analysis of two datasets of Twitter messages (in total 4334 tweets), associated with; (i) hashtags commonly employed in automated sarcasm and irony detection approaches, and (ii) tweets relating to 25 distinct events, including, scandals, product releases, cultural events, accidents, terror incidents, etc. We also highlight the contextualised use of multi-word hashtags in the communication of humour, sarcasm and irony, pointing out that many sentiment analysis tools simply fail to recognise such hashtag-based expressions. Our findings also offer indicative evidence regarding the quality of training data used for automated machine learning models in sarcasm, irony and sentiment detection. Worryingly only 15% of tweetsAs the use of automated social media analysis tools surges, concerns over accuracy of analytics have increased. Some tentative evidence suggests that sarcasm alone could account for as much as a 50% drop in accuracy when automatically detecting sentiment. This paper assesses and outlines the prevalence of sarcastic and ironic language within social media posts. Several past studies proposed models for automatic sarcasm and irony detection for sentiment analysis; however, these approaches result in models trained on training data of highly questionable quality, with little qualitative appreciation of the underlying data. To understand the issues and scale of the problem, we are the first to conduct and present results of a focused manual semantic annotation analysis of two datasets of Twitter messages (in total 4334 tweets), associated with; (i) hashtags commonly employed in automated sarcasm and irony detection approaches, and (ii) tweets relating to 25 distinct events, including, scandals, product releases, cultural events, accidents, terror incidents, etc. We also highlight the contextualised use of multi-word hashtags in the communication of humour, sarcasm and irony, pointing out that many sentiment analysis tools simply fail to recognise such hashtag-based expressions. Our findings also offer indicative evidence regarding the quality of training data used for automated machine learning models in sarcasm, irony and sentiment detection. Worryingly only 15% of tweets labelled as sarcastic were truly sarcastic. We highlight the need for future research studies to rethink their approach to data preparation and a more careful interpretation of sentiment analysis. … (more)
- Is Part Of:
- Big data & society. Volume 7:Issue 2(2020)
- Journal:
- Big data & society
- Issue:
- Volume 7:Issue 2(2020)
- Issue Display:
- Volume 7, Issue 2 (2020)
- Year:
- 2020
- Volume:
- 7
- Issue:
- 2
- Issue Sort Value:
- 2020-0007-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-11
- Subjects:
- Social media -- sarcasm -- irony -- sentiment analysis -- Twitter
Big data -- Social aspects -- Periodicals
Social sciences -- Research -- Data processing -- Periodicals
Social sciences -- Research -- Methodology -- Periodicals
Data mining -- Periodicals
300.28557 - Journal URLs:
- http://bds.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/2053951720972735 ↗
- Languages:
- English
- ISSNs:
- 2053-9517
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14492.xml