FLAKE: Fuzzy Graph Centrality-based Automatic Keyword Extraction. (5th December 2020)
- Record Type:
- Journal Article
- Title:
- FLAKE: Fuzzy Graph Centrality-based Automatic Keyword Extraction. (5th December 2020)
- Main Title:
- FLAKE: Fuzzy Graph Centrality-based Automatic Keyword Extraction
- Authors:
- Jain, Amita
Mittal, Kanika
Vaisla, Kunwar Singh - Abstract:
- Abstract: Keyword extraction is one of the most important aspects of text mining. Keywords help in identifying the document context. Many researchers have contributed their work to keyword extraction. They proposed approaches based on the frequency of occurrence, the position of words or the similarity between two terms. However, these approaches have shown shortcomings. In this paper, we propose a method that tries to overcome some of these shortcomings and present a new algorithm whose efficiency has been evaluated against widely used benchmarks. It is found from the analysis of standard datasets that the position of word in the document plays an important role in the identification of keywords. In this paper, a fuzzy logic-based automatic keyword extraction (FLAKE) method is proposed. FLAKE assigns weights to the keywords by considering the relative position of each word in the entire document as well as in the sentence coupled with the total occurrences of that word in the document. Based on the above data, candidate keywords are selected. Using WordNet, a fuzzy graph is constructed whose nodes represent candidate keywords. At this point, the most important nodes (based on fuzzy graph centrality measures) are identified. Those important nodes are selected as final keywords. The experiments conducted on various datasets show that proposed approach outperforms other keyword extraction methodologies by enhancing precision and recall.
- Is Part Of:
- Computer journal. Volume 65:Number 4(2022)
- Journal:
- Computer journal
- Issue:
- Volume 65:Number 4(2022)
- Issue Display:
- Volume 65, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 65
- Issue:
- 4
- Issue Sort Value:
- 2022-0065-0004-0000
- Page Start:
- 926
- Page End:
- 939
- Publication Date:
- 2020-12-05
- Subjects:
- fuzzy graph centrality -- information retrieval -- keyword extraction -- tf-idf
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxaa133 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21290.xml