A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. Issue 1 (December 2017)
- Record Type:
- Journal Article
- Title:
- A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. Issue 1 (December 2017)
- Main Title:
- A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies
- Authors:
- Teschendorff, Andrew
Breeze, Charles
Zheng, Shijie
Beck, Stephan - Abstract:
- Abstract Background Intra-sample cellular heterogeneity presents numerous challenges to the identification of biomarkers in large Epigenome-Wide Association Studies (EWAS). While a number of reference-based deconvolution algorithms have emerged, their potential remains underexplored and a comparative evaluation of these algorithms beyond tissues such as blood is still lacking. Results Here we present a novel framework for reference-based inference, which leverages cell-type specific DNAse Hypersensitive Site (DHS) information from the NIH Epigenomics Roadmap to construct an improved reference DNA methylation database. We show that this leads to a marginal but statistically significant improvement of cell-count estimates in whole blood as well as in mixtures involving epithelial cell-types. Using this framework we compare a widely used state-of-the-art reference-based algorithm (called constrained projection) to two non-constrained approaches including CIBERSORT and a method based on robust partial correlations. We conclude that the widely-used constrained projection technique may not always be optimal. Instead, we find that the method based on robust partial correlations is generally more robust across a range of different tissue types and for realistic noise levels. We call the combined algorithm which uses DHS data and robust partial correlations for inference, EpiDISH (Epi geneticD issection ofI ntra-S ampleH eterogeneity). Finally, we demonstrate the added value ofAbstract Background Intra-sample cellular heterogeneity presents numerous challenges to the identification of biomarkers in large Epigenome-Wide Association Studies (EWAS). While a number of reference-based deconvolution algorithms have emerged, their potential remains underexplored and a comparative evaluation of these algorithms beyond tissues such as blood is still lacking. Results Here we present a novel framework for reference-based inference, which leverages cell-type specific DNAse Hypersensitive Site (DHS) information from the NIH Epigenomics Roadmap to construct an improved reference DNA methylation database. We show that this leads to a marginal but statistically significant improvement of cell-count estimates in whole blood as well as in mixtures involving epithelial cell-types. Using this framework we compare a widely used state-of-the-art reference-based algorithm (called constrained projection) to two non-constrained approaches including CIBERSORT and a method based on robust partial correlations. We conclude that the widely-used constrained projection technique may not always be optimal. Instead, we find that the method based on robust partial correlations is generally more robust across a range of different tissue types and for realistic noise levels. We call the combined algorithm which uses DHS data and robust partial correlations for inference, EpiDISH (Epi geneticD issection ofI ntra-S ampleH eterogeneity). Finally, we demonstrate the added value of EpiDISH in an EWAS of smoking. Conclusions Estimating cell-type fractions and subsequent inference in EWAS may benefit from the use of non-constrained reference-based cell-type deconvolution methods. … (more)
- Is Part Of:
- BMC bioinformatics. Volume 18:Issue 1(2017)
- Journal:
- BMC bioinformatics
- Issue:
- Volume 18:Issue 1(2017)
- Issue Display:
- Volume 18, Issue 1 (2017)
- Year:
- 2017
- Volume:
- 18
- Issue:
- 1
- Issue Sort Value:
- 2017-0018-0001-0000
- Page Start:
- 1
- Page End:
- 14
- Publication Date:
- 2017-12
- Subjects:
- Cellular heterogeneity -- DNA methylation -- EWAS
Bioinformatics -- Periodicals
Computational biology -- Periodicals
570.285 - Journal URLs:
- http://www.biomedcentral.com/bmcbioinformatics/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=13 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12859-017-1511-5 ↗
- Languages:
- English
- ISSNs:
- 1471-2105
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - Digital store
British Library HMNTS - ELD Digital store - Ingest File:
- 10027.xml