An empirical Bayes test for allelic-imbalance detection in ChIP-seq. (3rd November 2017)
- Record Type:
- Journal Article
- Title:
- An empirical Bayes test for allelic-imbalance detection in ChIP-seq. (3rd November 2017)
- Main Title:
- An empirical Bayes test for allelic-imbalance detection in ChIP-seq
- Authors:
- Zhang, Qi
Keleş, Sündüz - Abstract:
- SUMMARY: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) has enabled discovery of genomic regions enriched with biological signals such as transcription factor binding and histone modifications. Allelic-imbalance (ALI) detection is a complementary analysis of ChIP-seq data for associating biological signals with single nucleotide polymorphisms (SNPs). It has been successfully used in elucidating functional roles of non-coding SNPs. Commonly used statistical approaches for ALI detection are often based on binomial testing and mixture models, both of which rely on strong assumptions on the distribution of the unobserved allelic probability, and have significant practical shortcomings. We propose Non-Parametric Binomial (NPBin) test for ALI detection and for modeling Binomial data in general. NPBin models the density of the unobserved allelic probability non-parametrically, and estimates its empirical null distribution via curve fitting. We demonstrate the advantages of NPBin in terms of interpretability of the estimated density and the accuracy in ALI detection using simulations and analysis of several ChIP-seq data sets. We also illustrate the generality of our modeling framework beyond ALI detection by an application to a baseball batting average prediction problem. This article has supplementary material available at Biostatistics online. The code and the sample input data have been also deposited to githubSUMMARY: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) has enabled discovery of genomic regions enriched with biological signals such as transcription factor binding and histone modifications. Allelic-imbalance (ALI) detection is a complementary analysis of ChIP-seq data for associating biological signals with single nucleotide polymorphisms (SNPs). It has been successfully used in elucidating functional roles of non-coding SNPs. Commonly used statistical approaches for ALI detection are often based on binomial testing and mixture models, both of which rely on strong assumptions on the distribution of the unobserved allelic probability, and have significant practical shortcomings. We propose Non-Parametric Binomial (NPBin) test for ALI detection and for modeling Binomial data in general. NPBin models the density of the unobserved allelic probability non-parametrically, and estimates its empirical null distribution via curve fitting. We demonstrate the advantages of NPBin in terms of interpretability of the estimated density and the accuracy in ALI detection using simulations and analysis of several ChIP-seq data sets. We also illustrate the generality of our modeling framework beyond ALI detection by an application to a baseball batting average prediction problem. This article has supplementary material available at Biostatistics online. The code and the sample input data have been also deposited to github https://github.com/QiZhangStat/ALIdetection . … (more)
- Is Part Of:
- Biostatistics. Volume 19:Number 4(2018)
- Journal:
- Biostatistics
- Issue:
- Volume 19:Number 4(2018)
- Issue Display:
- Volume 19, Issue 4 (2018)
- Year:
- 2018
- Volume:
- 19
- Issue:
- 4
- Issue Sort Value:
- 2018-0019-0004-0000
- Page Start:
- 546
- Page End:
- 561
- Publication Date:
- 2017-11-03
- Subjects:
- Allelic-imbalance -- ChIP-seq -- Empirical Bayes -- Non-parametric density estimation -- Spline
Medical statistics -- Periodicals
Biometry -- Periodicals
Health risk assessment -- Periodicals
Medicine -- Research -- Statistical methods -- Periodicals
610.727 - Journal URLs:
- http://www3.oup.co.uk/biosts ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/biostatistics/kxx060 ↗
- Languages:
- English
- ISSNs:
- 1465-4644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2089.628000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12211.xml