A Semantic Embedding Enhanced Topic Model For User-Generated Textual Content Modeling In Social Ecosystems. (1st October 2022)
- Record Type:
- Journal Article
- Title:
- A Semantic Embedding Enhanced Topic Model For User-Generated Textual Content Modeling In Social Ecosystems. (1st October 2022)
- Main Title:
- A Semantic Embedding Enhanced Topic Model For User-Generated Textual Content Modeling In Social Ecosystems
- Authors:
- Zhang, Peng
Liu, Baoxi
Lu, Tun
Gu, Hansu
Ding, Xianghua
Gu, Ning - Abstract:
- Abstract: The development of Information and Communication Technologies (ICT) and Web 2.0 promotes the emergence of diverse social ecosystems like social Internet of Things (IoT), social media and online communities. User-generated textual content (UGTC), which consists of unstructured texts, is the most important and common type of user-generated content in social ecosystems. UGTC in social ecosystems is generated according to two types of context information—global context (topics) and local context (semantic regularities). For UGTC modeling, topic models just consider global context but ignore semantic regularities, while semantic embedding models are on the opposite. So only utilizing topic models or semantic embedding models to model UGTC suffers from some drawbacks. For this problem, we propose a semantic embedding enhanced topic model named SEE-Twitter-LDA for accurately modeling UGTC in social ecosystems. The core of SEE-Twitter-LDA is that words are generated according to mutual semantic information of topics and semantic regularities. So global context and local context are jointly considered for UGTC modeling. By utilizing 553 098 tweets sampled from Twitter and 211 233 posts sampled from Weibo, we validate SEE-Twitter-LDA's better performance on perplexity, topic divergence and topic coherence versus existing related models.
- Is Part Of:
- Computer journal. Volume 65:Number 11(2022)
- Journal:
- Computer journal
- Issue:
- Volume 65:Number 11(2022)
- Issue Display:
- Volume 65, Issue 11 (2022)
- Year:
- 2022
- Volume:
- 65
- Issue:
- 11
- Issue Sort Value:
- 2022-0065-0011-0000
- Page Start:
- 2953
- Page End:
- 2968
- Publication Date:
- 2022-10-01
- Subjects:
- Social Ecosystems -- User-generated Textual Content -- Topic Model -- Semantic Embedding -- Twitter -- Weibo
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxac091 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24771.xml