A Bayesian Model Selection Approach for Identifying Differentially Expressed Transcripts from RNA Sequencing Data. Issue 1 (7th February 2017)
- Record Type:
- Journal Article
- Title:
- A Bayesian Model Selection Approach for Identifying Differentially Expressed Transcripts from RNA Sequencing Data. Issue 1 (7th February 2017)
- Main Title:
- A Bayesian Model Selection Approach for Identifying Differentially Expressed Transcripts from RNA Sequencing Data
- Authors:
- Papastamoulis, Panagiotis
Rattray, Magnus - Abstract:
- Summary: Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available inference methods treat them separately: a primary model is used to estimate expression and its output is post processed by using a differential expression model. In the paper, both issues are simultaneously addressed by proposing the joint estimation of expression levels and differential expression: the unknown relative abundance of each transcript can either be equal or not between two conditions. A hierarchical Bayesian model builds on the BitSeq framework and the posterior distribution of transcript expression and differential expression is inferred by using Markov chain Monte Carlo sampling. It is shown that the model proposed enjoys conjugacy for fixed dimension variables; thus the full conditional distributions are analytically derived. Two samplers are constructed, a reversible jump Markov chain Monte Carlo sampler and a collapsed Gibbs sampler, and the latter is found to perform better. A cluster representation of the aligned reads to the transcriptome is introduced, allowing parallel estimation of the marginal posterior distribution of subsets of transcripts under reasonable computing time. Under a fixed prior probability of differential expression the clusterwise sampler has the same marginal posterior distributions asSummary: Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available inference methods treat them separately: a primary model is used to estimate expression and its output is post processed by using a differential expression model. In the paper, both issues are simultaneously addressed by proposing the joint estimation of expression levels and differential expression: the unknown relative abundance of each transcript can either be equal or not between two conditions. A hierarchical Bayesian model builds on the BitSeq framework and the posterior distribution of transcript expression and differential expression is inferred by using Markov chain Monte Carlo sampling. It is shown that the model proposed enjoys conjugacy for fixed dimension variables; thus the full conditional distributions are analytically derived. Two samplers are constructed, a reversible jump Markov chain Monte Carlo sampler and a collapsed Gibbs sampler, and the latter is found to perform better. A cluster representation of the aligned reads to the transcriptome is introduced, allowing parallel estimation of the marginal posterior distribution of subsets of transcripts under reasonable computing time. Under a fixed prior probability of differential expression the clusterwise sampler has the same marginal posterior distributions as the raw sampler, but a more general prior structure is also employed. The algorithm proposed is benchmarked against alternative methods by using synthetic data sets and applied to real RNA sequencing data. Source code is available on line from https://github.com/mqbssppe/cjBitSeq . … (more)
- Is Part Of:
- Journal of the Royal Statistical Society. Volume 67:Issue 1(2018:Jan.)
- Journal:
- Journal of the Royal Statistical Society
- Issue:
- Volume 67:Issue 1(2018:Jan.)
- Issue Display:
- Volume 67, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 67
- Issue:
- 1
- Issue Sort Value:
- 2018-0067-0001-0000
- Page Start:
- 3
- Page End:
- 23
- Publication Date:
- 2017-02-07
- Subjects:
- Collapsed Gibbs sampler -- Mixture models -- Reversible jump Markov chain Monte Carlo sampling -- Ribonucleic acid sequencing
Statistics -- Periodicals
519.5 - Journal URLs:
- http://rss.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)1467-9876/ ↗
https://academic.oup.com/jrsssc ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/rssc.12213 ↗
- Languages:
- English
- ISSNs:
- 0035-9254
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1580.000000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 26100.xml