A hybrid k-prototypes clustering approach with improved sine-cosine algorithm for mixed-data classification. (July 2022)
- Record Type:
- Journal Article
- Title:
- A hybrid k-prototypes clustering approach with improved sine-cosine algorithm for mixed-data classification. (July 2022)
- Main Title:
- A hybrid k-prototypes clustering approach with improved sine-cosine algorithm for mixed-data classification
- Authors:
- Kuo, Timothy
Wang, Kung-Jeng - Abstract:
- Highlights: Develop a novel clustering-based classification algorithm for mixed data. Use a sine–cosine algorithm (SCA) to find attribute weights and initial centroids. The objective function of the algorithm is formulated as a sum-up purity. Four mutation strategies are embedded into the original SCA. The results showed that the proposed algorithm can achieve superior accuracy. Abstract: When dealing a classification problem with mixed data, most of conventional supervised learning algorithms cannot perform well due to their numerical characteristics. However, some clustering algorithms, such as k -prototypes algorithm, show their potential in clustering mixed data. Therefore, the current study intends to develop a novel clustering-based classification algorithm for mixed data to have both merits of classification and clustering. The proposed algorithm employs a sine-cosine algorithm (SCA) to find attribute weights and initial centroids for a k -prototypes algorithm. The objective function of the algorithm is formulated as a sum-up purity. To have better performance for SCA, a mutation strategy, containing Gaussian mutation, Cauchy mutation, Levy mutation, and single-point mutation, is embedded into the original SCA. The proposed algorithm is compared with some metaheuristic-based classification algorithms and existing classification algorithms. Based on the 10 data sets from UCI, the experimental results indicated that the proposed algorithm can achieve superiorHighlights: Develop a novel clustering-based classification algorithm for mixed data. Use a sine–cosine algorithm (SCA) to find attribute weights and initial centroids. The objective function of the algorithm is formulated as a sum-up purity. Four mutation strategies are embedded into the original SCA. The results showed that the proposed algorithm can achieve superior accuracy. Abstract: When dealing a classification problem with mixed data, most of conventional supervised learning algorithms cannot perform well due to their numerical characteristics. However, some clustering algorithms, such as k -prototypes algorithm, show their potential in clustering mixed data. Therefore, the current study intends to develop a novel clustering-based classification algorithm for mixed data to have both merits of classification and clustering. The proposed algorithm employs a sine-cosine algorithm (SCA) to find attribute weights and initial centroids for a k -prototypes algorithm. The objective function of the algorithm is formulated as a sum-up purity. To have better performance for SCA, a mutation strategy, containing Gaussian mutation, Cauchy mutation, Levy mutation, and single-point mutation, is embedded into the original SCA. The proposed algorithm is compared with some metaheuristic-based classification algorithms and existing classification algorithms. Based on the 10 data sets from UCI, the experimental results indicated that the proposed algorithm can achieve superior classification performance in terms of accuracy and Cohen's Kappa. In addition, mutation mechanism can make SCA have better performance. … (more)
- Is Part Of:
- Computers & industrial engineering. Volume 169(2022)
- Journal:
- Computers & industrial engineering
- Issue:
- Volume 169(2022)
- Issue Display:
- Volume 169, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 169
- Issue:
- 2022
- Issue Sort Value:
- 2022-0169-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07
- Subjects:
- Classification -- Clustering -- Clustering-based classification -- Mixed data -- Genetic algorithm -- Sine-cosine algorithm -- Mutation
Engineering -- Data processing -- Periodicals
Industrial engineering -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03608352 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cie.2022.108164 ↗
- Languages:
- English
- ISSNs:
- 0360-8352
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.713000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22113.xml