Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis. (14th October 2016)
- Record Type:
- Journal Article
- Title:
- Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis. (14th October 2016)
- Main Title:
- Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis
- Authors:
- Rigaill, Guillem
Balzergue, Sandrine
Brunaud, Véronique
Blondet, Eddy
Rau, Andrea
Rogier, Odile
Caius, José
Maugis-Rabusseau, Cathy
Soubigou-Taconnat, Ludivine
Aubourg, Sébastien
Lurin, Claire
Martin-Magniette, Marie-Laure
Delannoy, Etienne - Abstract:
- Abstract: Numerous statistical pipelines are now available for the differential analysis of gene expression measured with RNA-sequencing technology. Most of them are based on similar statistical frameworks after normalization, differing primarily in the choice of data distribution, mean and variance estimation strategy and data filtering. We propose an evaluation of the impact of these choices when few biological replicates are available through the use of synthetic data sets. This framework is based on real data sets and allows the exploration of various scenarios differing in the proportion of non-differentially expressed genes. Hence, it provides an evaluation of the key ingredients of the differential analysis, free of the biases associated with the simulation of data using parametric models. Our results show the relevance of a proper modeling of the mean by using linear or generalized linear modeling. Once the mean is properly modeled, the impact of the other parameters on the performance of the test is much less important. Finally, we propose to use the simple visualization of the raw P -value histogram as a practical evaluation criterion of the performance of differential analysis methods on real data sets.
- Is Part Of:
- Briefings in bioinformatics. Volume 19:Number 1(2018:Jan.)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 19:Number 1(2018:Jan.)
- Issue Display:
- Volume 19, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 19
- Issue:
- 1
- Issue Sort Value:
- 2018-0019-0001-0000
- Page Start:
- 65
- Page End:
- 76
- Publication Date:
- 2016-10-14
- Subjects:
- RNA-seq -- differential analysis -- benchmark data set
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbw092 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12362.xml