A Multi-view Kernel Clustering framework for Categorical sequences. (1st July 2022)
- Record Type:
- Journal Article
- Title:
- A Multi-view Kernel Clustering framework for Categorical sequences. (1st July 2022)
- Main Title:
- A Multi-view Kernel Clustering framework for Categorical sequences
- Authors:
- Xu, Kunpeng
Chen, Lifei
Wang, Shengrui - Abstract:
- Abstract: Multi-view clustering, which optimally integrates complementary information from different views to improve clustering performance, has drawn considerable attention in recent years. Despite recent advances, issues remain when dealing with data of high dimensionality and heterogeneity, especially in categorical sequences. These unique challenges and properties have motivated us to develop a novel M ulti-view K ernel C lustering framework for C ategorical sequences (MKCC), where views are expressed in terms of kernel matrices and a weighted combination of the instances is learned in parallel to the partitioning. Concretely, MKCC adaptively constructs the kernel matrix without the need of defining the kernel function. Nonetheless, the computational cost of storing the kernel matrix is O ( N 2 ) . To address this issue, we integrate a simple and efficient method for approximating the kernel matrix. A new multi-view clustering algorithm and a cluster validity index for categorical sequences are also proposed based on the framework. An empirical analysis on synthetic data sets and several commonly used real-world data sets demonstrates the appropriateness of the proposal, with the results showing the method's outstanding performance. Highlights: A multi-view kernel clustering framework for categorical sequences is proposed. This method constructs the kernel matrix directly from the sequence views. A cluster validity index is proposed for categorical sequences.Abstract: Multi-view clustering, which optimally integrates complementary information from different views to improve clustering performance, has drawn considerable attention in recent years. Despite recent advances, issues remain when dealing with data of high dimensionality and heterogeneity, especially in categorical sequences. These unique challenges and properties have motivated us to develop a novel M ulti-view K ernel C lustering framework for C ategorical sequences (MKCC), where views are expressed in terms of kernel matrices and a weighted combination of the instances is learned in parallel to the partitioning. Concretely, MKCC adaptively constructs the kernel matrix without the need of defining the kernel function. Nonetheless, the computational cost of storing the kernel matrix is O ( N 2 ) . To address this issue, we integrate a simple and efficient method for approximating the kernel matrix. A new multi-view clustering algorithm and a cluster validity index for categorical sequences are also proposed based on the framework. An empirical analysis on synthetic data sets and several commonly used real-world data sets demonstrates the appropriateness of the proposal, with the results showing the method's outstanding performance. Highlights: A multi-view kernel clustering framework for categorical sequences is proposed. This method constructs the kernel matrix directly from the sequence views. A cluster validity index is proposed for categorical sequences. Experimentation on synthetic and real-world data sets. … (more)
- Is Part Of:
- Expert systems with applications. Volume 197(2022)
- Journal:
- Expert systems with applications
- Issue:
- Volume 197(2022)
- Issue Display:
- Volume 197, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 197
- Issue:
- 2022
- Issue Sort Value:
- 2022-0197-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07-01
- Subjects:
- Multi-view clustering -- Kernel -- Categorical sequences -- Sample weighting -- Cluster validation
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2022.116637 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21273.xml