A cluster-directed framework for neighbour based imputation of missing value in microarray data. (2016)
- Record Type:
- Journal Article
- Title:
- A cluster-directed framework for neighbour based imputation of missing value in microarray data. (2016)
- Main Title:
- A cluster-directed framework for neighbour based imputation of missing value in microarray data
- Authors:
- Keerin, Phimmarin
Kurutach, Werasak
Boongoen, Tossapon - Abstract:
- DNA microarray has been the most widely used functional genomics approach in bioinformatics. However, microarray data suffer from frequent missing values due to various experimental and data handling reasons. Leaving this unsolved may degrade the reliability of any consequent downstream analysis. As such, missing value imputation has been recognised as an important pre-processing step, which can yield the quality of data and its interpretation. Several techniques found in the literature have successfully exploited the characteristics and relations among a set of genes closest to the one under examination. However, the selection of so-called nearest neighbours is based simply on proximity between gene pairs, without taking the structural or grouping information into account. In response, this paper proposes a novel cluster-directed framework (CFNI: Cluster-directed Framework for Neighbour-based Imputation), in which data clustering is uniquely used to guide the identification of nearest neighbours. This allows a more accurate imputed value to be derived. Not only it performs better than several benchmark methods on published microarray data sets; it is also generalised such that any neighbour-based imputation technique can be coupled with the proposed model. This has been successfully demonstrated with both single pass and iterative models.
- Is Part Of:
- International journal of data mining and bioinformatics. Volume 15:Number 2(2016)
- Journal:
- International journal of data mining and bioinformatics
- Issue:
- Volume 15:Number 2(2016)
- Issue Display:
- Volume 15, Issue 2 (2016)
- Year:
- 2016
- Volume:
- 15
- Issue:
- 2
- Issue Sort Value:
- 2016-0015-0002-0000
- Page Start:
- 165
- Page End:
- 193
- Publication Date:
- 2016
- Subjects:
- missing values -- imputation -- gene expression data -- data clustering -- regression -- nearest neighbour -- microarray data -- bioinformatics
Data mining -- Periodicals
Bioinformatics -- Periodicals
006.312 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijdmb ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1748-5673
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7812.xml