Graph-based relevancy-redundancy gene selection method for cancer diagnosis. (August 2022)
- Record Type:
- Journal Article
- Title:
- Graph-based relevancy-redundancy gene selection method for cancer diagnosis. (August 2022)
- Main Title:
- Graph-based relevancy-redundancy gene selection method for cancer diagnosis
- Authors:
- Azadifar, Saeid
Rostami, Mehrdad
Berahmand, Kamal
Moradi, Parham
Oussalah, Mourad - Abstract:
- Abstract: Nowadays, microarray data processing is one of the most important applications in molecular biology for cancer diagnosis. A major task in microarray data processing is gene selection, which aims to find a subset of genes with the least inner similarity and most relevant to the target class. Removing unnecessary, redundant, or noisy data reduces the data dimensionality. This research advocates a graph theoretic-based gene selection method for cancer diagnosis. Both unsupervised and supervised modes use well-known and successful social network approaches such as the maximum weighted clique criterion and edge centrality to rank genes. The suggested technique has two goals: (i) to maximize the relevancy of the chosen genes with the target class and (ii) to reduce their inner redundancy. A maximum weighted clique is chosen in a repetitive way in each iteration of this procedure. The appropriate genes are then chosen from among the existing features in this maximum clique using edge centrality and gene relevance. In the experiment, several datasets consisting of Colon, Leukemia, SRBCT, Prostate Tumor, and Lung Cancer, with different properties, are used to demonstrate the efficacy of the developed model. Our performance is compared to that of renowned filter-based gene selection approaches for cancer diagnosis whose results demonstrate a clear superiority. Graphical abstract: Image 1 Highlights: In this paper, a novel graph theoretic-based gene selection method isAbstract: Nowadays, microarray data processing is one of the most important applications in molecular biology for cancer diagnosis. A major task in microarray data processing is gene selection, which aims to find a subset of genes with the least inner similarity and most relevant to the target class. Removing unnecessary, redundant, or noisy data reduces the data dimensionality. This research advocates a graph theoretic-based gene selection method for cancer diagnosis. Both unsupervised and supervised modes use well-known and successful social network approaches such as the maximum weighted clique criterion and edge centrality to rank genes. The suggested technique has two goals: (i) to maximize the relevancy of the chosen genes with the target class and (ii) to reduce their inner redundancy. A maximum weighted clique is chosen in a repetitive way in each iteration of this procedure. The appropriate genes are then chosen from among the existing features in this maximum clique using edge centrality and gene relevance. In the experiment, several datasets consisting of Colon, Leukemia, SRBCT, Prostate Tumor, and Lung Cancer, with different properties, are used to demonstrate the efficacy of the developed model. Our performance is compared to that of renowned filter-based gene selection approaches for cancer diagnosis whose results demonstrate a clear superiority. Graphical abstract: Image 1 Highlights: In this paper, a novel graph theoretic-based gene selection method is developed for cancer diagnosis. The main goal of this method is to maximize the relevancy of the chosen genes and reduce their inner redundancy. In this proposed method the optimal number of the final gene set is determined automatically. Several cancer datasets are used to demonstrate the efficacy of the developed model. Our results on cancer datasets indicate that our model is more accurate than previous models in predicting cancer. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 147(2022)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 147(2022)
- Issue Display:
- Volume 147, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 147
- Issue:
- 2022
- Issue Sort Value:
- 2022-0147-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-08
- Subjects:
- Gene selection -- Cancer diagnosis -- Maximum clique -- Edge centrality -- Social network analysis
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2022.105766 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22279.xml