A centroid-based gene selection method for microarray data classification. (7th July 2016)
- Record Type:
- Journal Article
- Title:
- A centroid-based gene selection method for microarray data classification. (7th July 2016)
- Main Title:
- A centroid-based gene selection method for microarray data classification
- Authors:
- Guo, Shun
Guo, Donghui
Chen, Lifei
Jiang, Qingshan - Abstract:
- Abstract: For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods. Highlights: A gene selection method is proposed by using centroid-based discriminant criterion. Kernel-based expectation is used to estimate the class centroid. An efficient algorithm is proposed for the L1-regularized optimization problem. Theoretical analysis indicates that the global optimal solution can be reached. Our method has linearAbstract: For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods. Highlights: A gene selection method is proposed by using centroid-based discriminant criterion. Kernel-based expectation is used to estimate the class centroid. An efficient algorithm is proposed for the L1-regularized optimization problem. Theoretical analysis indicates that the global optimal solution can be reached. Our method has linear complexity with the number of genes and samples. … (more)
- Is Part Of:
- Journal of theoretical biology. Volume 400(2016)
- Journal:
- Journal of theoretical biology
- Issue:
- Volume 400(2016)
- Issue Display:
- Volume 400, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 400
- Issue:
- 2016
- Issue Sort Value:
- 2016-0400-2016-0000
- Page Start:
- 32
- Page End:
- 41
- Publication Date:
- 2016-07-07
- Subjects:
- Class centroid -- Microarray data -- Classification -- L1 regularization -- Gene selection
Biology -- Periodicals
Biological Science Disciplines -- Periodicals
Biology -- Periodicals
Biologie -- Périodiques
Theoretische biologie
Biology
Periodicals
571.05 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00225193/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.jtbi.2016.03.034 ↗
- Languages:
- English
- ISSNs:
- 0022-5193
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5069.075000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1294.xml