Impact of Missing Value Imputation on Classification for DNA Microarray Gene Expression Data—A Model-Based Study. (4th January 2010)
- Record Type:
- Journal Article
- Title:
- Impact of Missing Value Imputation on Classification for DNA Microarray Gene Expression Data—A Model-Based Study. (4th January 2010)
- Main Title:
- Impact of Missing Value Imputation on Classification for DNA Microarray Gene Expression Data—A Model-Based Study
- Authors:
- Sun, Youting
Braga-Neto, Ulisses
Dougherty, Edward R. - Other Names:
- Wang Yue Academic Editor.
- Abstract:
- Abstract : Many missing-value (MV) imputation methods have been developed for microarray data, but only a few studies have investigated the relationship between MV imputation and classification accuracy. Furthermore, these studies are problematic in fundamental steps such as MV generation and classifier error estimation. In this work, we carry out a model-based study that addresses some of the issues in previous studies. Six popular imputation algorithms, two feature selection methods, and three classification rules are considered. The results suggest that it is beneficial to apply MV imputation when the noise level is high, variance is small, or gene-cluster correlation is strong, under small to moderate MV rates. In these cases, if data quality metrics are available, then it may be helpful to consider the data point with poor quality as missing and apply one of the most robust imputation algorithms to estimate the true signal based on the available high-quality data points. However, at large MV rates, we conclude that imputation methods are not recommended. Regarding the MV rate, our results indicate the presence of a peaking phenomenon: performance of imputation methods actually improves initially as the MV rate increases, but after an optimum point, performance quickly deteriorates with increasing MV rates.
- Is Part Of:
- EURASIP journal on bioinformatics and systems biology. Volume 2009(2009)
- Journal:
- EURASIP journal on bioinformatics and systems biology
- Issue:
- Volume 2009(2009)
- Issue Display:
- Volume 2009, Issue 2009 (2009)
- Year:
- 2009
- Volume:
- 2009
- Issue:
- 2009
- Issue Sort Value:
- 2009-2009-2009-0000
- Page Start:
- Page End:
- Publication Date:
- 2010-01-04
- Subjects:
- Bioinformatics -- Periodicals
Systems biology -- Periodicals
Systems Biology
Signal Processing, Computer-Assisted
Bio-informatique
Biologie systémique
Bioinformatics
Systems biology
Systems Biology
Bioinformatics
Electronic journals
Periodical
Fulltext
Internet Resources
Periodicals
Periodicals
570.285 - Journal URLs:
- https://link.springer.com/journal/13637 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1155/2009/504069 ↗
- Languages:
- English
- ISSNs:
- 1687-4145
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10566.xml