Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization. Issue 1 (2nd January 2020)
- Record Type:
- Journal Article
- Title:
- Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization. Issue 1 (2nd January 2020)
- Main Title:
- Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization
- Authors:
- Weylandt, Michael
Nagorski, John
Allen, Genevera I. - Abstract:
- Abstract: Convex clustering is a promising new approach to the classical problem of clustering, combining strong performance in empirical studies with rigorous theoretical foundations. Despite these advantages, convex clustering has not been widely adopted, due to its computationally intensive nature and its lack of compelling visualizations. To address these impediments, we introduce Algorithmic Regularization, an innovative technique for obtaining high-quality estimates of regularization paths using an iterative one-step approximation scheme. We justify our approach with a novel theoretical result, guaranteeing global convergence of the approximate path to the exact solution under easily checked non-data-dependent assumptions. The application of algorithmic regularization to convex clustering yields the Convex Clustering via Algorithmic Regularization Paths (CARP) algorithm for computing the clustering solution path. On example datasets from genomics and text analysis, CARP delivers over a 100-fold speed-up over existing methods, while attaining a finer approximation grid than standard methods. Furthermore, CARP enables improved visualization of clustering solutions: the fine solution grid returned by CARP can be used to construct a convex clustering-based dendrogram, as well as forming the basis of a dynamic path-wise visualization based on modern web technologies. Our methods are implemented in the open-source R package clustRviz, available atAbstract: Convex clustering is a promising new approach to the classical problem of clustering, combining strong performance in empirical studies with rigorous theoretical foundations. Despite these advantages, convex clustering has not been widely adopted, due to its computationally intensive nature and its lack of compelling visualizations. To address these impediments, we introduce Algorithmic Regularization, an innovative technique for obtaining high-quality estimates of regularization paths using an iterative one-step approximation scheme. We justify our approach with a novel theoretical result, guaranteeing global convergence of the approximate path to the exact solution under easily checked non-data-dependent assumptions. The application of algorithmic regularization to convex clustering yields the Convex Clustering via Algorithmic Regularization Paths (CARP) algorithm for computing the clustering solution path. On example datasets from genomics and text analysis, CARP delivers over a 100-fold speed-up over existing methods, while attaining a finer approximation grid than standard methods. Furthermore, CARP enables improved visualization of clustering solutions: the fine solution grid returned by CARP can be used to construct a convex clustering-based dendrogram, as well as forming the basis of a dynamic path-wise visualization based on modern web technologies. Our methods are implemented in the open-source R package clustRviz, available at https://github.com/DataSlingers/clustRviz . Supplementary materials for this article are available online. … (more)
- Is Part Of:
- Journal of computational and graphical statistics. Volume 29:Issue 1(2020)
- Journal:
- Journal of computational and graphical statistics
- Issue:
- Volume 29:Issue 1(2020)
- Issue Display:
- Volume 29, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 29
- Issue:
- 1
- Issue Sort Value:
- 2020-0029-0001-0000
- Page Start:
- 87
- Page End:
- 96
- Publication Date:
- 2020-01-02
- Subjects:
- Algorithmic regularization -- Clustering -- Convex clustering -- Dendrograms -- Optimization -- Visualization
Mathematical statistics -- Data processing -- Periodicals
Mathematical statistics -- Graphic methods -- Periodicals
519.50285 - Journal URLs:
- http://pubs.amstat.org/loi/jcgs ↗
http://www.catchword.com/titles/10857117.htm ↗
http://www.tandf.co.uk/journals/titles/10618600.asp ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/10618600.2019.1629943 ↗
- Languages:
- English
- ISSNs:
- 1061-8600
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4963.451000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13794.xml