An approach for normalization and quality control for NanoString RNA expression data. Issue 3 (13th August 2020)
- Record Type:
- Journal Article
- Title:
- An approach for normalization and quality control for NanoString RNA expression data. Issue 3 (13th August 2020)
- Main Title:
- An approach for normalization and quality control for NanoString RNA expression data
- Authors:
- Bhattacharya, Arjun
Hamilton, Alina M
Furberg, Helena
Pietzak, Eugene
Purdue, Mark P
Troester, Melissa A
Hoadley, Katherine A
Love, Michael I - Abstract:
- Abstract: The NanoString RNA counting assay for formalin-fixed paraffin embedded samples is unique in its sensitivity, technical reproducibility and robustness for analysis of clinical and archival samples. While commercial normalization methods are provided by NanoString, they are not optimal for all settings, particularly when samples exhibit strong technical or biological variation or where housekeeping genes have variable performance across the cohort. Here, we develop and evaluate a more comprehensive normalization procedure for NanoString data with steps for quality control, selection of housekeeping targets, normalization and iterative data visualization and biological validation. The approach was evaluated using a large cohort ($N=\kern0.5em 1649$ ) from the Carolina Breast Cancer Study, two cohorts of moderate sample size ($N=359$ and$130$ ) and a small published dataset ($N=12$ ). The iterative process developed here eliminates technical variation (e.g. from different study phases or sites) more reliably than the three other methods, including NanoString's commercial package, without diminishing biological variation, especially in long-term longitudinal multiphase or multisite cohorts. We also find that probe sets validated for nCounter, such as the PAM50 gene signature, are impervious to batch issues. This work emphasizes that systematic quality control, normalization and visualization of NanoString nCounter data are an imperative component of study design thatAbstract: The NanoString RNA counting assay for formalin-fixed paraffin embedded samples is unique in its sensitivity, technical reproducibility and robustness for analysis of clinical and archival samples. While commercial normalization methods are provided by NanoString, they are not optimal for all settings, particularly when samples exhibit strong technical or biological variation or where housekeeping genes have variable performance across the cohort. Here, we develop and evaluate a more comprehensive normalization procedure for NanoString data with steps for quality control, selection of housekeeping targets, normalization and iterative data visualization and biological validation. The approach was evaluated using a large cohort ($N=\kern0.5em 1649$ ) from the Carolina Breast Cancer Study, two cohorts of moderate sample size ($N=359$ and$130$ ) and a small published dataset ($N=12$ ). The iterative process developed here eliminates technical variation (e.g. from different study phases or sites) more reliably than the three other methods, including NanoString's commercial package, without diminishing biological variation, especially in long-term longitudinal multiphase or multisite cohorts. We also find that probe sets validated for nCounter, such as the PAM50 gene signature, are impervious to batch issues. This work emphasizes that systematic quality control, normalization and visualization of NanoString nCounter data are an imperative component of study design that influences results in downstream analyses. … (more)
- Is Part Of:
- Briefings in bioinformatics. Volume 22:Issue 3(2021)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 22:Issue 3(2021)
- Issue Display:
- Volume 22, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 22
- Issue:
- 3
- Issue Sort Value:
- 2021-0022-0003-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08-13
- Subjects:
- NanoString nCounter expression -- gene expression normalization -- quality control -- data visualization
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbaa163 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24959.xml