The impact of isolation kernel on agglomerative hierarchical clustering algorithms. (July 2023)
- Record Type:
- Journal Article
- Title:
- The impact of isolation kernel on agglomerative hierarchical clustering algorithms. (July 2023)
- Main Title:
- The impact of isolation kernel on agglomerative hierarchical clustering algorithms
- Authors:
- Han, Xin
Zhu, Ye
Ting, Kai Ming
Li, Gang - Abstract:
- Highlights: Providing the condition under which an AHC does not effectively extract clusters. Introducing the entanglement to measure the dendrogram quality. Identifying the root cause of a density bias of traditional AHC algorithms. Improving AHC performance with a data-dependent kernel. Abstract: Agglomerative hierarchical clustering (AHC) is one of the popular clustering approaches. AHC generates a dendrogram that provides richer information and insights from a dataset than partitioning clustering. However, a major problem with existing distance-based AHC methods is: it fails to effectively identify adjacent clusters with varied densities, regardless of the cluster extraction methods applied to the resultant dendrogram. This paper aims to reveal the root cause of this issue and provides a solution by using a data-dependent kernel. We analyse the condition under which existing AHC methods fail to effectively extract clusters, and give the reason why the data-dependent kernel is an effective remedy. This leads to a new approach to kernerlise existing hierarchical clustering algorithms including the traditional AHC algorithms, HDBSCAN, GDL, PHA and HC-OT. Our extensive empirical evaluation shows that the recently introduced Isolation Kernel produces a higher quality or purer dendrogram than distance, Gaussian Kernel and adaptive Gaussian Kernel in all the above mentioned AHC algorithms.
- Is Part Of:
- Pattern recognition. Volume 139(2023)
- Journal:
- Pattern recognition
- Issue:
- Volume 139(2023)
- Issue Display:
- Volume 139, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 139
- Issue:
- 2023
- Issue Sort Value:
- 2023-0139-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-07
- Subjects:
- Agglomerative hierarchical clustering -- Varied densities -- Dendrogram purity -- Isolation kernel -- Gaussian kernel
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2023.109517 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26855.xml