A stacked autoencoder based gene selection and cancer classification framework. (September 2022)
- Record Type:
- Journal Article
- Title:
- A stacked autoencoder based gene selection and cancer classification framework. (September 2022)
- Main Title:
- A stacked autoencoder based gene selection and cancer classification framework
- Authors:
- Gokhale, Madhuri
Mohanty, Sraban Kumar
Ojha, Aparajita - Abstract:
- Abstract: Cancer is one of the most common causes of death worldwide and is, therefore, a prominent area of biomedical research. Cancer is a genetic disease in which improperly functioning genes tend to change expressions. Thus, gene expression analysis is utilized for early diagnosis of cancer prognosis, and therapy prediction in a clinical environment. Usually, some dominant genes among thousands of them play an important role in the diagnosis of cancer. But designing a suitable framework to find out the key set of genes is a challenging task. Numerous gene selection approaches have been introduced by researchers for cancer classification, using statistical, or traditional feature selection methods. In recent years, deep learning methods have also been applied for gene selection using autoencoder networks. However, improving the accuracy of cancer classification still remains a challenging task. In the present paper, a stacked autoencoder-based framework is proposed for gene selection and cancer classification. Nine different classifiers are employed to evaluate the performance of the gene selection model. Then the best performing combination of gene selection and cancer classification models are chosen to finally select the genes. Random Forest and Support Vector Machine show better performance on ten different benchmark datasets, when the gene selection is done using the stacked autoencoder. The classifier with the highest accuracy is selected to build the cancerAbstract: Cancer is one of the most common causes of death worldwide and is, therefore, a prominent area of biomedical research. Cancer is a genetic disease in which improperly functioning genes tend to change expressions. Thus, gene expression analysis is utilized for early diagnosis of cancer prognosis, and therapy prediction in a clinical environment. Usually, some dominant genes among thousands of them play an important role in the diagnosis of cancer. But designing a suitable framework to find out the key set of genes is a challenging task. Numerous gene selection approaches have been introduced by researchers for cancer classification, using statistical, or traditional feature selection methods. In recent years, deep learning methods have also been applied for gene selection using autoencoder networks. However, improving the accuracy of cancer classification still remains a challenging task. In the present paper, a stacked autoencoder-based framework is proposed for gene selection and cancer classification. Nine different classifiers are employed to evaluate the performance of the gene selection model. Then the best performing combination of gene selection and cancer classification models are chosen to finally select the genes. Random Forest and Support Vector Machine show better performance on ten different benchmark datasets, when the gene selection is done using the stacked autoencoder. The classifier with the highest accuracy is selected to build the cancer classification model. The proposed model outperforms seven existing methods on all the ten datasets. Highlights: A framework for gene selection and cancer classification is proposed that employs an autoencoder and a classifier from a bucket of nine classifiers. Selection of the autoencoder and related hyper parameters are done by an empirical study on ten different gene expression datasets. The number of most prominent genes is decided by iteratively decreasing the number of genes and finding the least possible number for effective cancer classification. The model selected through the proposed framework outperforms seven other existing models on all the ten datasets. Biological significance of the results has also been analyzed. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 78(2022)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 78(2022)
- Issue Display:
- Volume 78, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 78
- Issue:
- 2022
- Issue Sort Value:
- 2022-0078-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Gene selection -- Cancer classification -- Stacked autoencoder
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.103999 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23053.xml