Benchmarking association analyses of continuous exposures with RNA-seq in observational studies. Issue 6 (20th May 2021)
- Record Type:
- Journal Article
- Title:
- Benchmarking association analyses of continuous exposures with RNA-seq in observational studies. Issue 6 (20th May 2021)
- Main Title:
- Benchmarking association analyses of continuous exposures with RNA-seq in observational studies
- Authors:
- Sofer, Tamar
Kurniansyah, Nuzulul
Aguet, François
Ardlie, Kristin
Durda, Peter
Nickerson, Deborah A
Smith, Joshua D
Liu, Yongmei
Gharib, Sina A
Redline, Susan
Rich, Stephen S
Rotter, Jerome I
Taylor, Kent D - Abstract:
- Abstract: Large datasets of hundreds to thousands of individuals measuring RNA-seq in observational studies are becoming available. Many popular software packages for analysis of RNA-seq data were constructed to study differences in expression signatures in an experimental design with well-defined conditions (exposures). In contrast, observational studies may have varying levels of confounding transcript-exposure associations; further, exposure measures may vary from discrete (exposed, yes/no) to continuous (levels of exposure), with non-normal distributions of exposure. We compare popular software for gene expression—DESeq2, edgeR and limma—as well as linear regression-based analyses for studying the association of continuous exposures with RNA-seq. We developed a computation pipeline that includes transformation, filtering and generation of empirical null distribution of association P -values, and we apply the pipeline to compute empirical P -values with multiple testing correction. We employ a resampling approach that allows for assessment of false positive detection across methods, power comparison and the computation of quantile empirical P -values. The results suggest that linear regression methods are substantially faster with better control of false detections than other methods, even with the resampling method to compute empirical P -values. We provide the proposed pipeline with fast algorithms in an R package Olivia, and implemented it to study the associations ofAbstract: Large datasets of hundreds to thousands of individuals measuring RNA-seq in observational studies are becoming available. Many popular software packages for analysis of RNA-seq data were constructed to study differences in expression signatures in an experimental design with well-defined conditions (exposures). In contrast, observational studies may have varying levels of confounding transcript-exposure associations; further, exposure measures may vary from discrete (exposed, yes/no) to continuous (levels of exposure), with non-normal distributions of exposure. We compare popular software for gene expression—DESeq2, edgeR and limma—as well as linear regression-based analyses for studying the association of continuous exposures with RNA-seq. We developed a computation pipeline that includes transformation, filtering and generation of empirical null distribution of association P -values, and we apply the pipeline to compute empirical P -values with multiple testing correction. We employ a resampling approach that allows for assessment of false positive detection across methods, power comparison and the computation of quantile empirical P -values. The results suggest that linear regression methods are substantially faster with better control of false detections than other methods, even with the resampling method to compute empirical P -values. We provide the proposed pipeline with fast algorithms in an R package Olivia, and implemented it to study the associations of measures of sleep disordered breathing with RNA-seq in peripheral blood mononuclear cells in participants from the Multi-Ethnic Study of Atherosclerosis. … (more)
- Is Part Of:
- Briefings in bioinformatics. Volume 22:Issue 6(2021)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 22:Issue 6(2021)
- Issue Display:
- Volume 22, Issue 6 (2021)
- Year:
- 2021
- Volume:
- 22
- Issue:
- 6
- Issue Sort Value:
- 2021-0022-0006-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-05-20
- Subjects:
- observational studies -- continuous exposure -- non-normality -- RNA-seq -- empirical P-values
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbab194 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19693.xml