Practical aspects of gene regulatory inference via conditional inference forests from expression data. Issue 8 (11th November 2016)
- Record Type:
- Journal Article
- Title:
- Practical aspects of gene regulatory inference via conditional inference forests from expression data. Issue 8 (11th November 2016)
- Main Title:
- Practical aspects of gene regulatory inference via conditional inference forests from expression data
- Authors:
- Bessonov, Kyrylo
Van Steen, Kristel - Abstract:
- ABSTRACT: Gene regulatory network (GRN) inference is an active area of research that facilitates understanding the complex interplays between biological molecules. We propose a novel framework to create such GRNs, based on Conditional Inference Forests ( CIF s) as proposed by Strobl et al. Our framework consists of using ensembles of Conditional Inference Trees ( CIT s) and selecting an appropriate aggregation scheme for variant selection prior to network construction. We show on synthetic microarray data that taking the original implementation of CIF s with conditional permutation scheme ( CIFcond ) may lead to improved performance compared to Breiman's implementation of Random Forests ( RF ). Among all newly introduced CIF‐ based methods and five network scenarios obtained from the DREAM4 challenge, CIFcond performed best. Networks derived from well‐tuned CIF s, obtained by simply averaging P ‐values over tree ensembles ( CIFmean ) are particularly attractive, because they combine adequate performance with computational efficiency. Moreover, thresholds for variable selection are based on significance levels for P ‐values and, hence, do not need to be tuned. From a practical point of view, our extensive simulations show the potential advantages of CIFmean ‐based methods. Although more work is needed to improve on speed, especially when fully exploiting the advantages of CIT s in the context of heterogeneous and correlated data, we have shown that CIF methodology can beABSTRACT: Gene regulatory network (GRN) inference is an active area of research that facilitates understanding the complex interplays between biological molecules. We propose a novel framework to create such GRNs, based on Conditional Inference Forests ( CIF s) as proposed by Strobl et al. Our framework consists of using ensembles of Conditional Inference Trees ( CIT s) and selecting an appropriate aggregation scheme for variant selection prior to network construction. We show on synthetic microarray data that taking the original implementation of CIF s with conditional permutation scheme ( CIFcond ) may lead to improved performance compared to Breiman's implementation of Random Forests ( RF ). Among all newly introduced CIF‐ based methods and five network scenarios obtained from the DREAM4 challenge, CIFcond performed best. Networks derived from well‐tuned CIF s, obtained by simply averaging P ‐values over tree ensembles ( CIFmean ) are particularly attractive, because they combine adequate performance with computational efficiency. Moreover, thresholds for variable selection are based on significance levels for P ‐values and, hence, do not need to be tuned. From a practical point of view, our extensive simulations show the potential advantages of CIFmean ‐based methods. Although more work is needed to improve on speed, especially when fully exploiting the advantages of CIT s in the context of heterogeneous and correlated data, we have shown that CIF methodology can be flexibly inserted in a framework to infer biological interactions. Notably, we confirmed biologically relevant interaction between IL2RA and FOXP1, linked to the IL‐2 signaling pathway and to type 1 diabetes. … (more)
- Is Part Of:
- Genetic epidemiology. Volume 40:Issue 8(2016)
- Journal:
- Genetic epidemiology
- Issue:
- Volume 40:Issue 8(2016)
- Issue Display:
- Volume 40, Issue 8 (2016)
- Year:
- 2016
- Volume:
- 40
- Issue:
- 8
- Issue Sort Value:
- 2016-0040-0008-0000
- Page Start:
- 767
- Page End:
- 778
- Publication Date:
- 2016-11-11
- Subjects:
- biological interactions -- conditional inference forests -- gene regulatory networks
Genetic epidemiology -- Periodicals
Heredity -- Periodicals
Medical geography -- Periodicals
614 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-2272 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/gepi.22017 ↗
- Languages:
- English
- ISSNs:
- 0741-0395
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4111.848000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 483.xml