Detecting gene-gene interactions using a permutation-based random forest method. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Detecting gene-gene interactions using a permutation-based random forest method. Issue 1 (December 2016)
- Main Title:
- Detecting gene-gene interactions using a permutation-based random forest method
- Authors:
- Li, Jing
Malley, James
Andrew, Angeline
Karagas, Margaret
Moore, Jason - Abstract:
- Abstract Background Identifying gene-gene interactions is essential to understand disease susceptibility and to detect genetic architectures underlying complex diseases. Here, we aimed at developing a permutation-based methodology relying on a machine learning method, random forest (RF), to detect gene-gene interactions. Our approach called permuted random forest (pRF) which identified the top interacting single nucleotide polymorphism (SNP) pairs by estimating how much the power of a random forest classification model is influenced by removing pairwise interactions. Results We systematically tested our approach on a simulation study with datasets possessing various genetic constraints including heritability, number of SNPs, sample size, etc. Our methodology showed high success rates for detecting the interaction SNP pair. We also applied our approach to two bladder cancer datasets, which showed consistent results with well-studied methodologies, such as multifactor dimensionality reduction (MDR) and statistical epistasis network (SEN). Furthermore, we built permuted random forest networks (PRFN), in which we used nodes to represent SNPs and edges to indicate interactions. Conclusions We successfully developed a scale-invariant methodology to detect pure gene-gene interactions based on permutation strategies and the machine learning method random forest. This methodology showed great potential to be used for detecting gene-gene interactions to study underlying geneticAbstract Background Identifying gene-gene interactions is essential to understand disease susceptibility and to detect genetic architectures underlying complex diseases. Here, we aimed at developing a permutation-based methodology relying on a machine learning method, random forest (RF), to detect gene-gene interactions. Our approach called permuted random forest (pRF) which identified the top interacting single nucleotide polymorphism (SNP) pairs by estimating how much the power of a random forest classification model is influenced by removing pairwise interactions. Results We systematically tested our approach on a simulation study with datasets possessing various genetic constraints including heritability, number of SNPs, sample size, etc. Our methodology showed high success rates for detecting the interaction SNP pair. We also applied our approach to two bladder cancer datasets, which showed consistent results with well-studied methodologies, such as multifactor dimensionality reduction (MDR) and statistical epistasis network (SEN). Furthermore, we built permuted random forest networks (PRFN), in which we used nodes to represent SNPs and edges to indicate interactions. Conclusions We successfully developed a scale-invariant methodology to detect pure gene-gene interactions based on permutation strategies and the machine learning method random forest. This methodology showed great potential to be used for detecting gene-gene interactions to study underlying genetic architectures in a scale-free way, which could be benefit to uncover the complex disease mechanisms. … (more)
- Is Part Of:
- Biodata mining. Volume 9:Issue 1(2016)
- Journal:
- Biodata mining
- Issue:
- Volume 9:Issue 1(2016)
- Issue Display:
- Volume 9, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2016-0009-0001-0000
- Page Start:
- 1
- Page End:
- 17
- Publication Date:
- 2016-12
- Subjects:
- Random forest -- GWAS -- Machine learning -- Scale invariant
Bioinformatics -- Periodicals
Computational biology -- Periodicals
Data mining -- Periodicals
570.285 - Journal URLs:
- http://www.biodatamining.org/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s13040-016-0093-5 ↗
- Languages:
- English
- ISSNs:
- 1756-0381
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9879.xml