A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data. (15th March 2019)

Record Type:: Journal Article
Title:: A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data. (15th March 2019)
Main Title:: A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data
Authors:: Hussain, Syed Fawad
Haris, Muhammad
Abstract:: Highlights: A probabilistic random walk model for the 3 steps of the k -means algorithm. Mathematical foundation for efficacy and proofs for convergence is given. Clustering/co-clustering results show robustness, convergence and high accuracy. Abstract: The k -means algorithm is a widely used method that starts with an initial partitioning of the data and then iteratively converges towards the local solution by reducing the Sum of Squared Errors (SSE). It is known to suffer from the cluster center initialization problem and the iterative step simply (re-)labels the data points based on the initial partition. Most improvements to k -means proposed in the literature focus on the initialization step alone but make no attempt to guide the iterative convergence by exploiting statistical information from the data. Using higher order statistics (such as paths from random walks in a graph) and the duality in the data (as in co-clustering), for instance, are known ways to improve the clustering results. What is unique and significant in our proposed approach is that we embed these concepts into the k -means algorithm rather than just using them as an external distance measure and present a unified framework called the k -means based co-clustering ( k CC) Algorithm. The initialization step has been modified to include multiple points to represent each cluster center such that points within a cluster are close together but are far from points representing other clusters. Moreover, … (more)
Is Part Of:: Expert systems with applications. Volume 118(2019)
Journal:: Expert systems with applications
Issue:: Volume 118(2019)
Issue Display:: Volume 118, Issue 2019 (2019)
Year:: 2019
Volume:: 118
Issue:: 2019
Issue Sort Value:: 2019-0118-2019-0000
Page Start:: 20
Page End:: 34
Publication Date:: 2019-03-15
Subjects:: Clustering -- K-means -- Centroid initialization -- Co-clustering -- Semantic similarity
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33
Journal URLs:: http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.eswa.2018.09.006 ↗
Languages:: English
ISSNs:: 0957-4174
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 14213.xml