Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets. (July 2023)
- Record Type:
- Journal Article
- Title:
- Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets. (July 2023)
- Main Title:
- Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets
- Authors:
- Zhao, Jia
Wang, Gang
Pan, Jeng-Shyang
Fan, Tanghuai
Lee, Ivan - Abstract:
- Highlights: A new DPC algorithm for uneven density datasets is proposed. A new local density calculation method based on fuzzy neighborhood is designed. A new allocation strategy based on weighted shared nearest neighbor is proposed. The new DPC algorithm has excellent clustering accuracy for different types of datasets. Abstract: Uneven density data refers to data with a certain difference in sample density between clusters. The local density of density peaks clustering algorithm (DPC) does not consider the effect of sample density difference between clusters of uneven density data, which may lead to wrong selection of cluster centers; the algorithm allocation strategy makes it easy to incorrectly allocate samples originally belonging to sparse clusters to dense clusters, which reduces clustering efficiency. In this study, we proposed the density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets (DPC-FWSN). First, a nearest neighbor fuzzy kernel function is obtained by combining K-nearest neighbor and fuzzy neighborhood. Then, local density is redefined by the nearest neighbor fuzzy kernel function. The local density can better characterize the distribution characteristics of the sample by balancing the contribution of sample density in dense and sparse areas, in order to avoid the situation that the sparse cluster does not have a cluster center. Finally, the allocation strategy for weighted shared neighbor similarity isHighlights: A new DPC algorithm for uneven density datasets is proposed. A new local density calculation method based on fuzzy neighborhood is designed. A new allocation strategy based on weighted shared nearest neighbor is proposed. The new DPC algorithm has excellent clustering accuracy for different types of datasets. Abstract: Uneven density data refers to data with a certain difference in sample density between clusters. The local density of density peaks clustering algorithm (DPC) does not consider the effect of sample density difference between clusters of uneven density data, which may lead to wrong selection of cluster centers; the algorithm allocation strategy makes it easy to incorrectly allocate samples originally belonging to sparse clusters to dense clusters, which reduces clustering efficiency. In this study, we proposed the density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets (DPC-FWSN). First, a nearest neighbor fuzzy kernel function is obtained by combining K-nearest neighbor and fuzzy neighborhood. Then, local density is redefined by the nearest neighbor fuzzy kernel function. The local density can better characterize the distribution characteristics of the sample by balancing the contribution of sample density in dense and sparse areas, in order to avoid the situation that the sparse cluster does not have a cluster center. Finally, the allocation strategy for weighted shared neighbor similarity is proposed to optimize the sample allocation at the boundary of the sparse cluster. Experiments are performed on IDPC-FA, FKNN-DPC, FNDPC, DPCSA and DPC for uneven density datasets, complex morphologies datasets and real datasets. The clustering results demonstrate that DPC-FWSN effectively handles datasets with uneven density distribution. … (more)
- Is Part Of:
- Pattern recognition. Volume 139(2023)
- Journal:
- Pattern recognition
- Issue:
- Volume 139(2023)
- Issue Display:
- Volume 139, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 139
- Issue:
- 2023
- Issue Sort Value:
- 2023-0139-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-07
- Subjects:
- Uneven density data -- Density peaks clustering -- Fuzzy neighborhood -- K-nearest neighbor -- Weighted shared neighbor
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2023.109406 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26769.xml