Random Indexing and Modified Random Indexing based approach for extractive text summarization. (January 2015)
- Record Type:
- Journal Article
- Title:
- Random Indexing and Modified Random Indexing based approach for extractive text summarization. (January 2015)
- Main Title:
- Random Indexing and Modified Random Indexing based approach for extractive text summarization
- Authors:
- Chatterjee, Niladri
Sahoo, Pramod Kumar - Abstract:
- Highlights: Three summarization techniques RISUM, RISUM+ and MRISUM are proposed. Cosine dissimilarity and Euclidean distance are used for proximity computation. Cosine dissimilarity unlike cosine similarity makes weighted PageRank to converge. MRISUM uses a convolution based scheme for context vector construction. MRISUM outperforms RISUM, RISUM+ and LSA+TRM. Abstract: Random Indexing based extractive text summarization has already been proposed in literature. This paper looks at the above technique in detail, and proposes several improvements. The improvements are both in terms of formation of index (word) vectors of the document, and construction of context vectors by using convolution instead of addition operation on the index vectors. Experiments have been conducted using both angular and linear distances as metrics for proximity. As a consequence, three improved versions of the algorithm, viz. RISUM, RISUM+ and MRISUM were obtained. These algorithms have been applied on DUC 2002 documents, and their comparative performance has been studied. Different ROUGE metrics have been used for performance evaluation. While RISUM and RISUM+ perform almost at par, MRISUM is found to outperform both RISUM and RISUM+ significantly. MRISUM also outperforms LSA+TRM based summarization approach. The study reveals that all the three Random Indexing based techniques proposed in this study produce consistent results when linear distance is used for measuring proximity.
- Is Part Of:
- Computer speech & language. Volume 29(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 29(2015)
- Issue Display:
- Volume 29, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 29
- Issue:
- 2015
- Issue Sort Value:
- 2015-0029-2015-0000
- Page Start:
- 32
- Page End:
- 44
- Publication Date:
- 2015-01
- Subjects:
- Word Space Model -- Random Indexing -- PageRank -- Convolution -- Modified Random Indexing
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2014.07.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5426.xml