Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq. Issue 14 (27th July 2021)
- Record Type:
- Journal Article
- Title:
- Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq. Issue 14 (27th July 2021)
- Main Title:
- Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq
- Authors:
- Massarat, Arya R
Sen, Arko
Jaureguy, Jeff
Tyndale, Sélène T
Fu, Yi
Erikson, Galina
McVicker, Graham - Abstract:
- Abstract: Genetic variants and de novo mutations in regulatory regions of the genome are typically discovered by whole-genome sequencing (WGS), however WGS is expensive and most WGS reads come from non-regulatory regions. The Assay for Transposase-Accessible Chromatin (ATAC-seq) generates reads from regulatory sequences and could potentially be used as a low-cost 'capture' method for regulatory variant discovery, but its use for this purpose has not been systematically evaluated. Here we apply seven variant callers to bulk and single-cell ATAC-seq data and evaluate their ability to identify single nucleotide variants (SNVs) and insertions/deletions (indels). In addition, we develop an ensemble classifier, VarCA, which combines features from individual variant callers to predict variants. The Genome Analysis Toolkit (GATK) is the best-performing individual caller with precision/recall on a bulk ATAC test dataset of 0.92/0.97 for SNVs and 0.87/0.82 for indels within ATAC-seq peak regions with at least 10 reads. On bulk ATAC-seq reads, VarCA achieves superior performance with precision/recall of 0.99/0.95 for SNVs and 0.93/0.80 for indels. On single-cell ATAC-seq reads, VarCA attains precision/recall of 0.98/0.94 for SNVs and 0.82/0.82 for indels. In summary, ATAC-seq reads can be used to accurately discover non-coding regulatory variants in the absence of whole-genome sequencing data and our ensemble method, VarCA, has the best overall performance.
- Is Part Of:
- Nucleic acids research. Volume 49:Issue 14(2021)
- Journal:
- Nucleic acids research
- Issue:
- Volume 49:Issue 14(2021)
- Issue Display:
- Volume 49, Issue 14 (2021)
- Year:
- 2021
- Volume:
- 49
- Issue:
- 14
- Issue Sort Value:
- 2021-0049-0014-0000
- Page Start:
- 7986
- Page End:
- 7994
- Publication Date:
- 2021-07-27
- Subjects:
- Nucleic acids -- Periodicals
Molecular biology -- Periodicals
572.805 - Journal URLs:
- http://nar.oxfordjournals.org/ ↗
http://www.ncbi.nlm.nih.gov/pmc/journals/4 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/nar/gkab621 ↗
- Languages:
- English
- ISSNs:
- 0305-1048
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6183.850000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18478.xml