An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking. (18th July 2016)
- Record Type:
- Journal Article
- Title:
- An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking. (18th July 2016)
- Main Title:
- An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking
- Authors:
- Saha, Sujay
Ghosh, Anupam
Seal, Dibyendu Bikash
Dey, Kashi Nath - Other Names:
- Ulutagay Gözde Academic Editor.
- Abstract:
- Abstract : Most of the gene expression data analysis algorithms require the entire gene expression matrix without any missing values. Hence, it is necessary to devise methods which would impute missing data values accurately. There exist a number of imputation algorithms to estimate those missing values. This work starts with a microarray dataset containing multiple missing values. We first apply the modified version of the fuzzy theory based existing method LRFDVImpute to impute multiple missing values of time series gene expression data and then validate the result of imputation by genetic algorithm (GA) based gene ranking methodology along with some regular statistical validation techniques, like RMSE method. Gene ranking, as far as our knowledge, has not been used yet to validate the result of missing value estimation. Firstly, the proposed method has been tested on the very popular Spellman dataset and results show that error margins have been drastically reduced compared to some previous works, which indirectly validates the statistical significance of the proposed method. Then it has been applied on four other 2-class benchmark datasets, like Colorectal Cancer tumours dataset (GDS4382), Breast Cancer dataset (GSE349-350), Prostate Cancer dataset, and DLBCL-FL (Leukaemia) for both missing value estimation and ranking the genes, and the results show that the proposed method can reach 100% classification accuracy with very few dominant genes, which indirectly validatesAbstract : Most of the gene expression data analysis algorithms require the entire gene expression matrix without any missing values. Hence, it is necessary to devise methods which would impute missing data values accurately. There exist a number of imputation algorithms to estimate those missing values. This work starts with a microarray dataset containing multiple missing values. We first apply the modified version of the fuzzy theory based existing method LRFDVImpute to impute multiple missing values of time series gene expression data and then validate the result of imputation by genetic algorithm (GA) based gene ranking methodology along with some regular statistical validation techniques, like RMSE method. Gene ranking, as far as our knowledge, has not been used yet to validate the result of missing value estimation. Firstly, the proposed method has been tested on the very popular Spellman dataset and results show that error margins have been drastically reduced compared to some previous works, which indirectly validates the statistical significance of the proposed method. Then it has been applied on four other 2-class benchmark datasets, like Colorectal Cancer tumours dataset (GDS4382), Breast Cancer dataset (GSE349-350), Prostate Cancer dataset, and DLBCL-FL (Leukaemia) for both missing value estimation and ranking the genes, and the results show that the proposed method can reach 100% classification accuracy with very few dominant genes, which indirectly validates the biological significance of the proposed method. … (more)
- Is Part Of:
- Advances in fuzzy systems. Volume 2016(2016)
- Journal:
- Advances in fuzzy systems
- Issue:
- Volume 2016(2016)
- Issue Display:
- Volume 2016, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 2016
- Issue:
- 2016
- Issue Sort Value:
- 2016-2016-2016-0000
- Page Start:
- Page End:
- Publication Date:
- 2016-07-18
- Subjects:
- Fuzzy systems -- Periodicals
Systèmes flous
Fuzzy systems
Periodicals
511.313 - Journal URLs:
- https://www.hindawi.com/journals/afs/ ↗
http://bibpurl.oclc.org/web/50278 ↗ - DOI:
- 10.1155/2016/6134736 ↗
- Languages:
- English
- ISSNs:
- 1687-7101
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 22847.xml