Detecting clickbaits using two-phase hybrid CNN-LSTM biterm model. (1st August 2020)
- Record Type:
- Journal Article
- Title:
- Detecting clickbaits using two-phase hybrid CNN-LSTM biterm model. (1st August 2020)
- Main Title:
- Detecting clickbaits using two-phase hybrid CNN-LSTM biterm model
- Authors:
- Kaur, Sawinder
Kumar, Parteek
Kumaraguru, Ponnurangam - Abstract:
- Highlights: A ground dataset has been prepared from Facebook page and Reddit website. Extraction of text from non-textual (image-based) data using pre-processing approach. Automatic identification of the eight types of clickbait is done. A novel approach is proposed which works under two-phase structure. Shocking/Unbelievable, Hypothesis/Guess, Reaction types of clickbait are published maximum on social media. Abstract: Clickbait indicates the type of content with an intending goal to attract the attention of readers. It has grown to become a nuisance to social media users. The purpose of clickbait is to bring an appealing link in front of users. Clickbaits seen in the form of headlines influence people to get attracted and curious to read the inside content. The content seen in the form of text on clickbait posts is very short to identify its features as clickbait. In this paper, a novel approach (two-phase hybrid CNN-LSTM Biterm model) has been proposed for modeling short topic content. The hybrid CNN-LSTM model when implemented with pre-trained GloVe embedding yields the best results based on accuracy, recall, precision, and F1-score performance metrics. The proposed model achieves 91.24%, 95.64%, 95.87% precision values for Dataset 1, Dataset 2 and Dataset 3, respectively. Eight types of clickbait such as Reasoning, Number, Reaction, Revealing, Shocking/Unbelievable, Hypothesis/Guess, Questionable, Forward referencing are classified in this work using the Biterm TopicHighlights: A ground dataset has been prepared from Facebook page and Reddit website. Extraction of text from non-textual (image-based) data using pre-processing approach. Automatic identification of the eight types of clickbait is done. A novel approach is proposed which works under two-phase structure. Shocking/Unbelievable, Hypothesis/Guess, Reaction types of clickbait are published maximum on social media. Abstract: Clickbait indicates the type of content with an intending goal to attract the attention of readers. It has grown to become a nuisance to social media users. The purpose of clickbait is to bring an appealing link in front of users. Clickbaits seen in the form of headlines influence people to get attracted and curious to read the inside content. The content seen in the form of text on clickbait posts is very short to identify its features as clickbait. In this paper, a novel approach (two-phase hybrid CNN-LSTM Biterm model) has been proposed for modeling short topic content. The hybrid CNN-LSTM model when implemented with pre-trained GloVe embedding yields the best results based on accuracy, recall, precision, and F1-score performance metrics. The proposed model achieves 91.24%, 95.64%, 95.87% precision values for Dataset 1, Dataset 2 and Dataset 3, respectively. Eight types of clickbait such as Reasoning, Number, Reaction, Revealing, Shocking/Unbelievable, Hypothesis/Guess, Questionable, Forward referencing are classified in this work using the Biterm Topic Model (BTM). It has been shown that the clickbaits such as Shocking/Unbelievable, Hypothesis/Guess and Reaction are the highest in numbers among rest of the clickbait headlines published online. Also, a ground dataset of non-textual (image-based) data using multiple social media platforms has been created in this paper. The textual information has been retrieved from the images with the help of OCR tool. A comparative study is performed to show the effectiveness of our proposed model which helps to identify the various categories of clickbait headlines that are spread on social media platforms. … (more)
- Is Part Of:
- Expert systems with applications. Volume 151(2020)
- Journal:
- Expert systems with applications
- Issue:
- Volume 151(2020)
- Issue Display:
- Volume 151, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 151
- Issue:
- 2020
- Issue Sort Value:
- 2020-0151-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08-01
- Subjects:
- Clickbait -- News -- Classifier -- Features -- Social media
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2020.113350 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13421.xml