A data mining framework based on boundary-points for gene selection from DNA-microarrays: Pancreatic Ductal Adenocarcinoma as a case study. (April 2018)
- Record Type:
- Journal Article
- Title:
- A data mining framework based on boundary-points for gene selection from DNA-microarrays: Pancreatic Ductal Adenocarcinoma as a case study. (April 2018)
- Main Title:
- A data mining framework based on boundary-points for gene selection from DNA-microarrays: Pancreatic Ductal Adenocarcinoma as a case study
- Authors:
- Ramos, Juan
Castellanos-Garzón, José A.
de Paz, Juan F.
Corchado, Juan M. - Abstract:
- Abstract: Gene selection (or feature selection) from DNA-microarray data can be focused on different techniques, which generally involve statistical tests, data mining and machine learning. In recent years there has been an increasing interest in using hybrid-technique sets to face the problem of meaningful gene selection; nevertheless, this issue remains a challenge. In an effort to address the situation, this paper proposes a novel hybrid framework based on data mining techniques and tuned to select gene subsets, which are meaningfully related to the target disease conducted in DNA-microarray experiments. For this purpose, the framework above deals with approaches such as statistical significance tests, cluster analysis, evolutionary computation, visual analytics and boundary points. The latter is the core technique of our proposal, allowing the framework to define two methods of gene selection. Another novelty of this work is the inclusion of the age of patients as an additional factor in our analysis, which can leading to gaining more insight into the disease. In fact, the results reached in this research have been very promising and have shown their biological validity. Hence, our proposal has resulted in a methodology that can be followed in the gene selection process from DNA-microarray data. Highlights: A data mining framework for gene selection. Application of clustering and boundary points to gene selection. Two gene selection methods (boundary intersection andAbstract: Gene selection (or feature selection) from DNA-microarray data can be focused on different techniques, which generally involve statistical tests, data mining and machine learning. In recent years there has been an increasing interest in using hybrid-technique sets to face the problem of meaningful gene selection; nevertheless, this issue remains a challenge. In an effort to address the situation, this paper proposes a novel hybrid framework based on data mining techniques and tuned to select gene subsets, which are meaningfully related to the target disease conducted in DNA-microarray experiments. For this purpose, the framework above deals with approaches such as statistical significance tests, cluster analysis, evolutionary computation, visual analytics and boundary points. The latter is the core technique of our proposal, allowing the framework to define two methods of gene selection. Another novelty of this work is the inclusion of the age of patients as an additional factor in our analysis, which can leading to gaining more insight into the disease. In fact, the results reached in this research have been very promising and have shown their biological validity. Hence, our proposal has resulted in a methodology that can be followed in the gene selection process from DNA-microarray data. Highlights: A data mining framework for gene selection. Application of clustering and boundary points to gene selection. Two gene selection methods (boundary intersection and evolutionary method). Pancreatic Ductal Adenocarcinoma as a case study (DNA-microarray data). Two informative gene subsets found, which can be used in biomarker research. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 70(2017:Oct.)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 70(2017:Oct.)
- Issue Display:
- Volume 70 (2017)
- Year:
- 2017
- Volume:
- 70
- Issue Sort Value:
- 2017-0070-0000-0000
- Page Start:
- 92
- Page End:
- 108
- Publication Date:
- 2018-04
- Subjects:
- Feature selection -- Gene selection -- Data mining -- Cluster analysis -- Evolutionary computation -- Boundary point -- DNA-microarray -- Visual analytics -- Filter method -- Boundary gene
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2018.01.007 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5947.xml