Improving the value of public RNA-seq expression data by phenotype prediction. Issue 9 (5th March 2018)
- Record Type:
- Journal Article
- Title:
- Improving the value of public RNA-seq expression data by phenotype prediction. Issue 9 (5th March 2018)
- Main Title:
- Improving the value of public RNA-seq expression data by phenotype prediction
- Authors:
- Ellis, Shannon E
Collado-Torres, Leonardo
Jaffe, Andrew
Leek, Jeffrey T - Abstract:
- Abstract: Publicly available genomic data are a valuable resource for studying normal human variation and disease, but these data are often not well labeled or annotated. The lack of phenotype information for public genomic data severely limits their utility for addressing targeted biological questions. We develop an in silico phenotyping approach for predicting critical missing annotation directly from genomic measurements using well-annotated genomic and phenotypic data produced by consortia like TCGA and GTEx as training data. We apply in silico phenotyping to a set of 70 000 RNA-seq samples we recently processed on a common pipeline as part of the recount2 project. We use gene expression data to build and evaluate predictors for both biological phenotypes (sex, tissue, sample source) and experimental conditions (sequencing strategy). We demonstrate how these predictions can be used to study cross-sample properties of public genomic data, select genomic projects with specific characteristics, and perform downstream analyses using predicted phenotypes. The methods to perform phenotype prediction are available in the phenopredict R package and the predictions for recount2 are available from the recount R package. With data and phenotype information available for 70, 000 human samples, expression data is available for use on a scale that was not previously feasible.
- Is Part Of:
- Nucleic acids research. Volume 46:Issue 9(2018)
- Journal:
- Nucleic acids research
- Issue:
- Volume 46:Issue 9(2018)
- Issue Display:
- Volume 46, Issue 9 (2018)
- Year:
- 2018
- Volume:
- 46
- Issue:
- 9
- Issue Sort Value:
- 2018-0046-0009-0000
- Page Start:
- e54
- Page End:
- e54
- Publication Date:
- 2018-03-05
- Subjects:
- Nucleic acids -- Periodicals
Molecular biology -- Periodicals
572.805 - Journal URLs:
- http://nar.oxfordjournals.org/ ↗
http://www.ncbi.nlm.nih.gov/pmc/journals/4 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/nar/gky102 ↗
- Languages:
- English
- ISSNs:
- 0305-1048
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6183.850000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12214.xml