Using the two-population genetic algorithm with distance-based k-nearest neighbour voting classifier for high-dimensional data. (2016)
- Record Type:
- Journal Article
- Title:
- Using the two-population genetic algorithm with distance-based k-nearest neighbour voting classifier for high-dimensional data. (2016)
- Main Title:
- Using the two-population genetic algorithm with distance-based k-nearest neighbour voting classifier for high-dimensional data
- Authors:
- Lee, Chien-Pang
Lin, Wen-Shin - Abstract:
- Owing to developments in computer technology, high-dimensional data has become a popular research issue. However, the traditional statistical methods cannot perform well when the variable numbers (p) are greater than the sample size (n). Accordingly, this paper proposes a novel hybrid model that combines statistical methodology with data mining techniques for the classification of high-dimensional data. In the proposed model, the Fisher's least significant difference test was originally used for initial dimension reduction. Subsequently, this paper uses a two-population genetic algorithms and a non-parametric statistics classification method (distance-based k-nearest neighbour voting classifier) to evaluate and to rank the variables' importance. Furthermore, the evaluation of the relevant variables for classification is considered with the outlier detection method. Eight different public gene expression datasets are used to compare the performance of the proposed model with the existing methods. The experimental results indicate that the proposed model performs better than the existing methods in terms of the classification accuracy.
- Is Part Of:
- International journal of data mining and bioinformatics. Volume 14:Number 4(2016)
- Journal:
- International journal of data mining and bioinformatics
- Issue:
- Volume 14:Number 4(2016)
- Issue Display:
- Volume 14, Issue 4 (2016)
- Year:
- 2016
- Volume:
- 14
- Issue:
- 4
- Issue Sort Value:
- 2016-0014-0004-0000
- Page Start:
- 315
- Page End:
- 331
- Publication Date:
- 2016
- Subjects:
- genetic algorithms -- k-nearest neighbour -- Fisher& -- #39 -- s least significant difference -- outlier detection -- high-dimensional data -- gene expression data -- data classification -- bioinformatics -- data mining
Data mining -- Periodicals
Bioinformatics -- Periodicals
006.312 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijdmb ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1748-5673
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7813.xml