Weighted clustering: Towards solving the user's dilemma. (December 2021)
- Record Type:
- Journal Article
- Title:
- Weighted clustering: Towards solving the user's dilemma. (December 2021)
- Main Title:
- Weighted clustering: Towards solving the user's dilemma
- Authors:
- Ackerman, Margareta
Ben-David, Shai
Brânzei, Simina
Loker, David - Abstract:
- Highlights: Properties that help solve the clustering users dilemma. Weighted properties formally differentiate clustering methods. A formal classification highlights advantages of center-based clustering techniques. Abstract: This paper makes a major step towards addressing a long-standing challenge in cluster analysis, known as the user's dilemma, which is the problem of selecting an appropriate clustering algorithm for a specific task. A formal approach for addressing this challenge relies on the identification of succinct, user-friendly properties that capture formal differences amongst clustering techniques. While helpful for gaining insight into the nature of clustering paradigms, there is a theory-practice gap that has so far limited the utility of this approach: Formal properties typically highlight advantages of classical linkage-based algorithms, while practical experience shows that center-based methods are preferable for many applications. We present simple new properties that delineate core differences between common clustering paradigms and overcome this theory-practice gap. The properties we present give a formal understanding of the advantages of center-based approaches for some applications and insight into when different clustering paradigms should be used. These properties address how sensitive algorithms are to changes in element frequencies, which we capture in a generalized setting where every element is associated with a real-valued weight. ToHighlights: Properties that help solve the clustering users dilemma. Weighted properties formally differentiate clustering methods. A formal classification highlights advantages of center-based clustering techniques. Abstract: This paper makes a major step towards addressing a long-standing challenge in cluster analysis, known as the user's dilemma, which is the problem of selecting an appropriate clustering algorithm for a specific task. A formal approach for addressing this challenge relies on the identification of succinct, user-friendly properties that capture formal differences amongst clustering techniques. While helpful for gaining insight into the nature of clustering paradigms, there is a theory-practice gap that has so far limited the utility of this approach: Formal properties typically highlight advantages of classical linkage-based algorithms, while practical experience shows that center-based methods are preferable for many applications. We present simple new properties that delineate core differences between common clustering paradigms and overcome this theory-practice gap. The properties we present give a formal understanding of the advantages of center-based approaches for some applications and insight into when different clustering paradigms should be used. These properties address how sensitive algorithms are to changes in element frequencies, which we capture in a generalized setting where every element is associated with a real-valued weight. To complement extensive formal analysis, we discuss how these properties can be applied in practice. … (more)
- Is Part Of:
- Pattern recognition. Volume 120(2021)
- Journal:
- Pattern recognition
- Issue:
- Volume 120(2021)
- Issue Display:
- Volume 120, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 120
- Issue:
- 2021
- Issue Sort Value:
- 2021-0120-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-12
- Subjects:
- Clustering -- Theory -- Properties
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2021.108152 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18480.xml