A split–merge clustering algorithm based on the k-nearest neighbor graph. Issue 111 (January 2023)
- Record Type:
- Journal Article
- Title:
- A split–merge clustering algorithm based on the k-nearest neighbor graph. Issue 111 (January 2023)
- Main Title:
- A split–merge clustering algorithm based on the k-nearest neighbor graph
- Authors:
- Wang, Yan
Ma, Yan
Huang, Hui
Wang, Bin
Acharjya, Debi Prasanna - Abstract:
- Abstract: Numerous graph-based clustering algorithms relying on k-nearest neighbor (KNN) have been proposed. However, the performance of these algorithms tends to be affected by many factors such as cluster shape, cluster density and outliers. To address these issues, we present a split–merge clustering algorithm based on the KNN graph (SMKNN), which is based on the idea that two adjacent clusters can be merged if the data points located in the connection layers of the two clusters tend to be consistent in distribution. In Stage 1, a KNN graph is constructed. In Stage 2, the subgraphs are obtained by removing the pivot points from the KNN graph, in which the pivot points are determined by the size of local distance ratio of data points. In Stage 3, the adjacent cluster pairs satisfying the maximum similarity are merged, in which the similarity measure of two clusters is designed with two concepts including external connection edges and internal connection edges. By the experiments on ten synthetic data sets and eight real data sets, we compared SMKNN with two traditional algorithms, two density-based algorithms, nine graph-based algorithms and four neural network based algorithms in accuracy. The experimental results demonstrate a good performance of the proposed clustering method.
- Is Part Of:
- Information systems. Issue 111(2023)
- Journal:
- Information systems
- Issue:
- Issue 111(2023)
- Issue Display:
- Volume 111, Issue 111 (2023)
- Year:
- 2023
- Volume:
- 111
- Issue:
- 111
- Issue Sort Value:
- 2023-0111-0111-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-01
- Subjects:
- K-nearest neighbor -- Clustering algorithm -- Split -- Merge -- Similarity measure
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2022.102124 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24109.xml