Web document summarisation using pointwise mutual information (PMI) from web resources. (17th December 2022)
- Record Type:
- Journal Article
- Title:
- Web document summarisation using pointwise mutual information (PMI) from web resources. (17th December 2022)
- Main Title:
- Web document summarisation using pointwise mutual information (PMI) from web resources
- Authors:
- Srivastava, Atul Kumar
Pandey, Dhiraj
Aggarwal, Alok
Gupta, Sunil - Abstract:
- Nowadays, large amount of data is generated over the internet. It is impossible for the humans to summarise such large chunks of bytes. Therefore, to deal with such challenges, automatic text summarisation systems are deployed. Text Summarisation is the field of data mining that highlights the relevance of important text in a document. In this paper, we proposed a web-based text summarisation approach that generates good quality summary based on total pointwise mutual information (TPMI) scores of the sentences. A sample document from DUC dataset is used which is pre-processed for tokenisation, stop words removal and stemming operations. Based on the extracted words, the TPMI is estimated by calculating the pointwise mutual information (PMI) of the occurrences of words on web search engine. To provide evidence for the robustness of our proposed system, proposed approach is compared with the well-known text summarisation techniques based on sentence length and mean score. The results show that our method outperforms the other techniques by exhibiting best results for closest mean score and generating good quality summary on sentences of different length.
- Is Part Of:
- International journal of system of systems engineering. Volume 12:Number 4(2022)
- Journal:
- International journal of system of systems engineering
- Issue:
- Volume 12:Number 4(2022)
- Issue Display:
- Volume 12, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 12
- Issue:
- 4
- Issue Sort Value:
- 2022-0012-0004-0000
- Page Start:
- 329
- Page End:
- 353
- Publication Date:
- 2022-12-17
- Subjects:
- document summarisation -- text summarisation -- PMI -- point-wise mutual information
003.71 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijsse#issue ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1748-0671
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 24248.xml