A bioinformatics pipeline for estimating mitochondrial DNA copy number and heteroplasmy levels from whole genome sequencing data. Issue 2 (17th May 2022)
- Record Type:
- Journal Article
- Title:
- A bioinformatics pipeline for estimating mitochondrial DNA copy number and heteroplasmy levels from whole genome sequencing data. Issue 2 (17th May 2022)
- Main Title:
- A bioinformatics pipeline for estimating mitochondrial DNA copy number and heteroplasmy levels from whole genome sequencing data
- Authors:
- Battle, Stephanie L
Puiu, Daniela
Verlouw, Joost
Broer, Linda
Boerwinkle, Eric
Taylor, Kent D
Rotter, Jerome I
Rich, Stephan S
Grove, Megan L
Pankratz, Nathan
Fetterman, Jessica L
Liu, Chunyu
Arking, Dan E - Abstract:
- Abstract: Mitochondrial diseases are a heterogeneous group of disorders that can be caused by mutations in the nuclear or mitochondrial genome. Mitochondrial DNA (mtDNA) variants may exist in a state of heteroplasmy, where a percentage of DNA molecules harbor a variant, or homoplasmy, where all DNA molecules have the same variant. The relative quantity of mtDNA in a cell, or copy number (mtDNA-CN), is associated with mitochondrial function, human disease, and mortality. To facilitate accurate identification of heteroplasmy and quantify mtDNA-CN, we built a bioinformatics pipeline that takes whole genome sequencing data and outputs mitochondrial variants, and mtDNA-CN. We incorporate variant annotations to facilitate determination of variant significance. Our pipeline yields uniform coverage by remapping to a circularized chrM and by recovering reads falsely mapped to nuclear-encoded mitochondrial sequences. Notably, we construct a consensus chrM sequence for each sample and recall heteroplasmy against the sample's unique mitochondrial genome. We observe an approximately 3-fold increased association with age for heteroplasmic variants in non-homopolymer regions and, are better able to capture genetic variation in the D-loop of chrM compared to existing software. Our bioinformatics pipeline more accurately captures features of mitochondrial genetics than existing pipelines that are important in understanding how mitochondrial dysfunction contributes to disease.
- Is Part Of:
- NAR genomics and bioinformatics. Volume 4:Issue 2(2022)
- Journal:
- NAR genomics and bioinformatics
- Issue:
- Volume 4:Issue 2(2022)
- Issue Display:
- Volume 4, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 4
- Issue:
- 2
- Issue Sort Value:
- 2022-0004-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-05-17
- Subjects:
- Genomics -- Periodicals
Bioinformatics -- Periodicals
572.8 - Journal URLs:
- http://www.oxfordjournals.org/ ↗
https://academic.oup.com/nargab ↗ - DOI:
- 10.1093/nargab/lqac034 ↗
- Languages:
- English
- ISSNs:
- 2631-9268
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21565.xml