Ultrametric fitting by gradient descent*This article is an updated version of: Chierchia G and Perret B 2019 Ultrametric fitting by gradient descent 33rd Conf. Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada vol 32 eds H Wallach, H Larochelle, A Beygelzimer, F d'Alché-Buc, E Fox and R Garnett pp 3181–92. (21st December 2020)
- Record Type:
- Journal Article
- Title:
- Ultrametric fitting by gradient descent*This article is an updated version of: Chierchia G and Perret B 2019 Ultrametric fitting by gradient descent 33rd Conf. Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada vol 32 eds H Wallach, H Larochelle, A Beygelzimer, F d'Alché-Buc, E Fox and R Garnett pp 3181–92. (21st December 2020)
- Main Title:
- Ultrametric fitting by gradient descent*This article is an updated version of: Chierchia G and Perret B 2019 Ultrametric fitting by gradient descent 33rd Conf. Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada vol 32 eds H Wallach, H Larochelle, A Beygelzimer, F d'Alché-Buc, E Fox and R Garnett pp 3181–92
- Authors:
- Chierchia, Giovanni
Perret, Benjamin - Abstract:
- Abstract: We study the problem of fitting an ultrametric distance to a dissimilarity graph in the context of hierarchical cluster analysis. Standard hierarchical clustering methods are specified procedurally, rather than in terms of the cost function to be optimized. We aim to overcome this limitation by presenting a general optimization framework for ultrametric fitting. Our approach consists of modeling the latter as a constrained optimization problem over the continuous space of ultrametrics. So doing, we can leverage the simple, yet effective, idea of replacing the ultrametric constraint with a min–max operation injected directly into the cost function. The proposed reformulation leads to an unconstrained optimization problem that can be efficiently solved by gradient descent methods. The flexibility of our framework allows us to investigate several cost functions, following the classic paradigm of combining a data fidelity term with a regularization. While we provide no theoretical guarantee to find the global optimum, the numerical results obtained over a number of synthetic and real datasets demonstrate the good performance of our approach with respect to state-of-the-art agglomerative algorithms. This makes us believe that the proposed framework sheds new light on the way to design a new generation of hierarchical clustering methods. Our code is made publicly available at https://github.com/PerretB/ultrametric-fitting .
- Is Part Of:
- Journal of statistical mechanics. (2020:Dec.)
- Journal:
- Journal of statistical mechanics
- Issue:
- (2020:Dec.)
- Issue Display:
- Volume 1000072 (2020)
- Year:
- 2020
- Volume:
- 1000072
- Issue Sort Value:
- 2020-1000072-0000-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-12-21
- Subjects:
- machine learning
Statistical mechanics -- Periodicals
Mechanics -- Statistical methods -- Periodicals
530.1305 - Journal URLs:
- http://ioppublishing.org/ ↗
- DOI:
- 10.1088/1742-5468/abc62d ↗
- Languages:
- English
- ISSNs:
- 1742-5468
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15293.xml