Microblog summarization using Paragraph Vector and semantic structure. (September 2019)
- Record Type:
- Journal Article
- Title:
- Microblog summarization using Paragraph Vector and semantic structure. (September 2019)
- Main Title:
- Microblog summarization using Paragraph Vector and semantic structure
- Authors:
- Wang, Ruiyi
Luo, Senlin
Pan, Limin
Wu, Zhouting
Yuan, Yujiao
Chen, Qianrou - Abstract:
- Highlights: Propose a microblog summarization method considering Paragraph Vector and semantic structure to ease the feature sparseness. Abstract: There are two fundamental difficulties that are still hindering the development of microblog summarization. The first problem is the features sparseness of microblog, which restricts the performance of sub-topics detection. The second one is the sentence selection from sub-topics that is based mainly on centrality approaches to measure sentence salience. Also, the semantic features and relations features between sentences and sub-topics were not given much attention. In order to address the two aforementioned problems, we propose a summarization method considering Paragraph Vector and semantic structure. Firstly, we construct sentence similarity matrix that involves the contextual information of microblogs to detect sub-topics by using Paragraph Vector . Secondly, we analyze the sentences by utilizing Chinese Sentential Semantic Model (CSM) to get semantic features; then the relations features are obtained based on the similarity matrix and semantic features above. Finally, the most informative sentences can be selected accurately from microblogs belonging to the same sub-topics by semantic features and relation features. The experimental results show that the ROUGE-1 value is up to 53.17% with 1.5% compression ratio. The results indicate that applying Paragraph Vector to the field of microblog summarization can effectivelyHighlights: Propose a microblog summarization method considering Paragraph Vector and semantic structure to ease the feature sparseness. Abstract: There are two fundamental difficulties that are still hindering the development of microblog summarization. The first problem is the features sparseness of microblog, which restricts the performance of sub-topics detection. The second one is the sentence selection from sub-topics that is based mainly on centrality approaches to measure sentence salience. Also, the semantic features and relations features between sentences and sub-topics were not given much attention. In order to address the two aforementioned problems, we propose a summarization method considering Paragraph Vector and semantic structure. Firstly, we construct sentence similarity matrix that involves the contextual information of microblogs to detect sub-topics by using Paragraph Vector . Secondly, we analyze the sentences by utilizing Chinese Sentential Semantic Model (CSM) to get semantic features; then the relations features are obtained based on the similarity matrix and semantic features above. Finally, the most informative sentences can be selected accurately from microblogs belonging to the same sub-topics by semantic features and relation features. The experimental results show that the ROUGE-1 value is up to 53.17% with 1.5% compression ratio. The results indicate that applying Paragraph Vector to the field of microblog summarization can effectively improve sub-topics detection. Additionally, semantic features and relation features enhance summarization result jointly. Furthermore, CSM provides a promising tool for sentence semantic analysis. … (more)
- Is Part Of:
- Computer speech & language. Volume 57(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 57(2019)
- Issue Display:
- Volume 57, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 57
- Issue:
- 2019
- Issue Sort Value:
- 2019-0057-2019-0000
- Page Start:
- 1
- Page End:
- 19
- Publication Date:
- 2019-09
- Subjects:
- Chinese Sentential Semantic Model -- Deep learning -- Language models -- Language parsing and understanding -- Microblog summarization
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2019.01.006 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10443.xml