A novel approach to text clustering using genetic algorithm based on the nearest neighbour heuristic. Issue 3 (4th March 2022)
- Record Type:
- Journal Article
- Title:
- A novel approach to text clustering using genetic algorithm based on the nearest neighbour heuristic. Issue 3 (4th March 2022)
- Main Title:
- A novel approach to text clustering using genetic algorithm based on the nearest neighbour heuristic
- Authors:
- Mustafi, D.
Mustafi, A.
Sahoo, G. - Abstract:
- Abstract : In this paper, we propose a novel clustering algorithm which uses a weighted combination of several criteria as its fitness function. We demonstrate the suitability of the new method in the case of clustering text documents. The proposed algorithm leverages the concept of nearest neighbour separation (NNS) to enhance the separation of the clusters and also outlines a heuristic to compute the NNS. A new parameterized fitness function has been proposed which can be tuned to provide more weightage to the traditional metrics based on inter- and intra-cluster distances of clusters or on the NNS. Genetic Algorithm has been used to perform the actual clustering and the results obtained has been compared with the traditional K-Means algorithm. The performance of the algorithm has been tested on different standard datasets, and the results have been presented.
- Is Part Of:
- International journal of computers and applications. Volume 44:Issue 3(2022)
- Journal:
- International journal of computers and applications
- Issue:
- Volume 44:Issue 3(2022)
- Issue Display:
- Volume 44, Issue 3 (2022)
- Year:
- 2022
- Volume:
- 44
- Issue:
- 3
- Issue Sort Value:
- 2022-0044-0003-0000
- Page Start:
- 291
- Page End:
- 303
- Publication Date:
- 2022-03-04
- Subjects:
- K-means -- genetic algorithm (GA) -- nearest neighbour separation (NNS) -- document clustering -- document representation -- cluster purity
Computers -- Periodicals
Computer software -- Periodicals
Computer networks -- Periodicals
Multimedia systems -- Periodicals
Internet -- Periodicals
World Wide Web -- Periodicals
Minicomputers -- Periodicals
Microcomputers -- Periodicals
004.05 - Journal URLs:
- http://www.tandfonline.com/toc/tjca20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/1206212X.2020.1735035 ↗
- Languages:
- English
- ISSNs:
- 1206-212X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.175480
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20994.xml