Detecting fake news by enhanced text representation with multi-EDU-structure awareness. (15th November 2022)
- Record Type:
- Journal Article
- Title:
- Detecting fake news by enhanced text representation with multi-EDU-structure awareness. (15th November 2022)
- Main Title:
- Detecting fake news by enhanced text representation with multi-EDU-structure awareness
- Authors:
- Wang, Yuhang
Wang, Li
Yang, Yanjie
Zhang, Yilin - Abstract:
- Abstract: Since fake news poses a serious threat to society and individuals, numerous studies have been brought by considering text, propagation and user profiles. Due to the data collection problem, these methods based on propagation and user profiles are less applicable in the early stages. A good alternative method is to detect news based on text as soon as they are released, and a lot of text-based methods were proposed, which usually utilized words, sentences or paragraphs as basic units. But, word is a too fine-grained unit to express coherent information well, sentence or paragraph is too coarse to show specific information. Which granularity is better and how to utilize it to enhance text representation for fake news detection are two key problems. In this paper, we introduce Elementary Discourse Unit (EDU) whose granularity is between word and sentence, and propose a multi-EDU-structure awareness model to improve text representation for fake news detection, namely EDU4FD. For the multi-EDU-structure awareness, we build the sequence-based EDU representations and the graph-based EDU representations. The former is gotten by modeling the coherence between consecutive EDUs with TextCNN that reflect the semantic coherence. For the latter, we first extract rhetorical relations to build the EDU dependency graph, which can show the global narrative logic and help deliver the main idea truthfully. Then a Relation Graph Attention Network (RGAT) is set to get the graph-basedAbstract: Since fake news poses a serious threat to society and individuals, numerous studies have been brought by considering text, propagation and user profiles. Due to the data collection problem, these methods based on propagation and user profiles are less applicable in the early stages. A good alternative method is to detect news based on text as soon as they are released, and a lot of text-based methods were proposed, which usually utilized words, sentences or paragraphs as basic units. But, word is a too fine-grained unit to express coherent information well, sentence or paragraph is too coarse to show specific information. Which granularity is better and how to utilize it to enhance text representation for fake news detection are two key problems. In this paper, we introduce Elementary Discourse Unit (EDU) whose granularity is between word and sentence, and propose a multi-EDU-structure awareness model to improve text representation for fake news detection, namely EDU4FD. For the multi-EDU-structure awareness, we build the sequence-based EDU representations and the graph-based EDU representations. The former is gotten by modeling the coherence between consecutive EDUs with TextCNN that reflect the semantic coherence. For the latter, we first extract rhetorical relations to build the EDU dependency graph, which can show the global narrative logic and help deliver the main idea truthfully. Then a Relation Graph Attention Network (RGAT) is set to get the graph-based EDU representation. Finally, the two EDU representations are incorporated as the enhanced text representation for fake news detection, using a gated recursive unit combined with a global attention mechanism. Experiments on four cross-source fake news datasets show that our model outperforms the state-of-the-art text-based methods. Our results suggest that considering EDU and its structural features could enhance the text representation for fake news detection. Highlights: Using EDU expressing content with an intermediate granularity to detect fake news. A sequence-based EDU representation is gotten by TextCNN. A graph-based EDU representation is obtained by EDU dependency graph and RGAT. Text representation is gotten by GRU with Global Attention mechanism. Extensive experiments on four cross-source datasets demonstrate the superiority. … (more)
- Is Part Of:
- Expert systems with applications. Volume 206(2022)
- Journal:
- Expert systems with applications
- Issue:
- Volume 206(2022)
- Issue Display:
- Volume 206, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 206
- Issue:
- 2022
- Issue Sort Value:
- 2022-0206-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-11-15
- Subjects:
- Fake news detection -- EDU -- Sequential structure -- Dependency graph structure -- TextCNN -- RGAT
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2022.117781 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23554.xml