Informative top-k class associative rule for cancer biomarker discovery on microarray data. (15th May 2020)
- Record Type:
- Journal Article
- Title:
- Informative top-k class associative rule for cancer biomarker discovery on microarray data. (15th May 2020)
- Main Title:
- Informative top-k class associative rule for cancer biomarker discovery on microarray data
- Authors:
- Ong, Huey Fang
Mustapha, Norwati
Hamdan, Hazlina
Rosli, Rozita
Mustapha, Aida - Abstract:
- Highlights: Biomarker discovery on microarray data for colorectal cancer and breast cancer. Integration with gene ontology, KEGG pathways, and protein-protein interactions. Rule interestingness based on information gain, accuracy, and enrichment score. Information gain and associative classification produce higher prediction. Significant reproducibility and interpretability of discovered genes. Abstract: The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule ( i TCAR) method in an integrative framework for identifying candidate genes of specific cancers. i TCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological information from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers.Highlights: Biomarker discovery on microarray data for colorectal cancer and breast cancer. Integration with gene ontology, KEGG pathways, and protein-protein interactions. Rule interestingness based on information gain, accuracy, and enrichment score. Information gain and associative classification produce higher prediction. Significant reproducibility and interpretability of discovered genes. Abstract: The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule ( i TCAR) method in an integrative framework for identifying candidate genes of specific cancers. i TCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological information from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers. The experimental results show that i TCAR has excellent predictability by achieving the average classification accuracy above 90% and the average area under the curve above 0.8. Besides, i TCAR has significant reproducibility and interpretability through functional enrichment analysis and retrieval of meaningful cancer terms. These promising results suggest the proposed method has great potential in identifying candidate genes, which can be further investigated as biomarkers for cancer diseases. … (more)
- Is Part Of:
- Expert systems with applications. Volume 146(2020)
- Journal:
- Expert systems with applications
- Issue:
- Volume 146(2020)
- Issue Display:
- Volume 146, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 146
- Issue:
- 2020
- Issue Sort Value:
- 2020-0146-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-05-15
- Subjects:
- Microarray gene expression -- Associative classification -- Information gain -- Biomarker discovery -- Colorectal cancer -- Breast cancer
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2019.113169 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12914.xml