Fully Bayesian Analysis of RNA-seq Counts for the Detection of Gene Expression Heterosis. Issue 526 (3rd April 2019)
- Record Type:
- Journal Article
- Title:
- Fully Bayesian Analysis of RNA-seq Counts for the Detection of Gene Expression Heterosis. Issue 526 (3rd April 2019)
- Main Title:
- Fully Bayesian Analysis of RNA-seq Counts for the Detection of Gene Expression Heterosis
- Authors:
- Landau, Will
Niemi, Jarad
Nettleton, Dan - Abstract:
- ABSTRACT: Heterosis, or hybrid vigor, is the enhancement of the phenotype of hybrid progeny relative to their inbred parents. Heterosis is extensively used in agriculture, and the underlying mechanisms are unclear. To investigate the molecular basis of phenotypic heterosis, researchers search tens of thousands of genes for heterosis with respect to expression in the transcriptome. Difficulty arises in the assessment of heterosis due to composite null hypotheses and nonuniform distributions for p -values under these null hypotheses. Thus, we develop a general hierarchical model for count data and a fully Bayesian analysis in which an efficient parallelized Markov chain Monte Carlo algorithm ameliorates the computational burden. We use our method to detect gene expression heterosis in a two-hybrid plant-breeding scenario, both in a real RNA-seq maize dataset and in simulation studies. In the simulation studies, we show our method has well-calibrated posterior probabilities and credible intervals when the model assumed in analysis matches the model used to simulate the data. Although model misspecification can adversely affect calibration, the methodology is still able to accurately rank genes. Finally, we show that hyperparameter posteriors are extremely narrow and an empirical Bayes (eBayes) approach based on posterior means from the fully Bayesian analysis provides virtually equivalent posterior probabilities, credible intervals, and gene rankings relative to the fullyABSTRACT: Heterosis, or hybrid vigor, is the enhancement of the phenotype of hybrid progeny relative to their inbred parents. Heterosis is extensively used in agriculture, and the underlying mechanisms are unclear. To investigate the molecular basis of phenotypic heterosis, researchers search tens of thousands of genes for heterosis with respect to expression in the transcriptome. Difficulty arises in the assessment of heterosis due to composite null hypotheses and nonuniform distributions for p -values under these null hypotheses. Thus, we develop a general hierarchical model for count data and a fully Bayesian analysis in which an efficient parallelized Markov chain Monte Carlo algorithm ameliorates the computational burden. We use our method to detect gene expression heterosis in a two-hybrid plant-breeding scenario, both in a real RNA-seq maize dataset and in simulation studies. In the simulation studies, we show our method has well-calibrated posterior probabilities and credible intervals when the model assumed in analysis matches the model used to simulate the data. Although model misspecification can adversely affect calibration, the methodology is still able to accurately rank genes. Finally, we show that hyperparameter posteriors are extremely narrow and an empirical Bayes (eBayes) approach based on posterior means from the fully Bayesian analysis provides virtually equivalent posterior probabilities, credible intervals, and gene rankings relative to the fully Bayesian solution. This evidence of equivalence provides support for the use of eBayes procedures in RNA-seq data analysis if accurate hyperparameter estimates can be obtained. Supplementary materials for this article are available online. … (more)
- Is Part Of:
- Journal of the American Statistical Association. Volume 114:Issue 526(2019)
- Journal:
- Journal of the American Statistical Association
- Issue:
- Volume 114:Issue 526(2019)
- Issue Display:
- Volume 114, Issue 526 (2019)
- Year:
- 2019
- Volume:
- 114
- Issue:
- 526
- Issue Sort Value:
- 2019-0114-0526-0000
- Page Start:
- 610
- Page End:
- 621
- Publication Date:
- 2019-04-03
- Subjects:
- CUDA -- Empirical Bayes -- Graphics processing unit -- Hierarchical model -- Hybrid vigor -- Negative-binomial
Statistics -- Periodicals
Statistics -- Periodicals
Statistiques -- Périodiques
États-Unis -- Statistiques -- Périodiques
519.5 - Journal URLs:
- http://www.jstor.org/journals/01621459.html ↗
http://www.ingentaconnect.com/content/asa/jasa ↗
http://www.tandfonline.com/loi/uasa20 ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/01621459.2018.1497496 ↗
- Languages:
- English
- ISSNs:
- 0162-1459
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4694.000000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11175.xml