Development of the Wheat Practical Haplotype Graph database as a resource for genotyping data storage and genotype imputation. Issue 2 (9th November 2021)
- Record Type:
- Journal Article
- Title:
- Development of the Wheat Practical Haplotype Graph database as a resource for genotyping data storage and genotype imputation. Issue 2 (9th November 2021)
- Main Title:
- Development of the Wheat Practical Haplotype Graph database as a resource for genotyping data storage and genotype imputation
- Authors:
- Jordan, Katherine W
Bradbury, Peter J
Miller, Zachary R
Nyine, Moses
He, Fei
Fraser, Max
Anderson, Jim
Mason, Esten
Katz, Andrew
Pearce, Stephen
Carter, Arron H
Prather, Samuel
Pumphrey, Michael
Chen, Jianli
Cook, Jason
Liu, Shuyu
Rudd, Jackie C
Wang, Zhen
Chu, Chenggen
Ibrahim, Amir M H
Turkus, Jonathan
Olson, Eric
Nagarajan, Ragupathi
Carver, Brett
Yan, Liuling
Taagen, Ellie
Sorrells, Mark
Ward, Brian
Ren, Jie
Akhunova, Alina
Bai, Guihua
Bowden, Robert
Fiedler, Jason
Faris, Justin
Dubcovsky, Jorge
Guttieri, Mary
Brown-Guedira, Gina
Buckler, Ed
Jannink, Jean-Luc
Akhunov, Eduard D
… (more) - Editors:
- de Koning, D -J
- Abstract:
- Abstract: To improve the efficiency of high-density genotype data storage and imputation in bread wheat ( Triticum aestivum L.), we applied the Practical Haplotype Graph (PHG) tool. The Wheat PHG database was built using whole-exome capture sequencing data from a diverse set of 65 wheat accessions. Population haplotypes were inferred for the reference genome intervals defined by the boundaries of the high-quality gene models. Missing genotypes in the inference panels, composed of wheat cultivars or recombinant inbred lines genotyped by exome capture, genotyping-by-sequencing (GBS), or whole-genome skim-seq sequencing approaches, were imputed using the Wheat PHG database. Though imputation accuracy varied depending on the method of sequencing and coverage depth, we found 92% imputation accuracy with 0.01× sequence coverage, which was slightly lower than the accuracy obtained using the 0.5× sequence coverage (96.6%). Compared to Beagle, on average, PHG imputation was ∼3.5% ( P -value < 2 × 10 −14 ) more accurate, and showed 27% higher accuracy at imputing a rare haplotype introgressed from a wild relative into wheat. We found reduced accuracy of imputation with independent 2× GBS data (88.6%), which increases to 89.2% with the inclusion of parental haplotypes in the database. The accuracy reduction with GBS is likely associated with the small overlap between GBS markers and the exome capture dataset, which was used for constructing PHG. The highest imputation accuracy wasAbstract: To improve the efficiency of high-density genotype data storage and imputation in bread wheat ( Triticum aestivum L.), we applied the Practical Haplotype Graph (PHG) tool. The Wheat PHG database was built using whole-exome capture sequencing data from a diverse set of 65 wheat accessions. Population haplotypes were inferred for the reference genome intervals defined by the boundaries of the high-quality gene models. Missing genotypes in the inference panels, composed of wheat cultivars or recombinant inbred lines genotyped by exome capture, genotyping-by-sequencing (GBS), or whole-genome skim-seq sequencing approaches, were imputed using the Wheat PHG database. Though imputation accuracy varied depending on the method of sequencing and coverage depth, we found 92% imputation accuracy with 0.01× sequence coverage, which was slightly lower than the accuracy obtained using the 0.5× sequence coverage (96.6%). Compared to Beagle, on average, PHG imputation was ∼3.5% ( P -value < 2 × 10 −14 ) more accurate, and showed 27% higher accuracy at imputing a rare haplotype introgressed from a wild relative into wheat. We found reduced accuracy of imputation with independent 2× GBS data (88.6%), which increases to 89.2% with the inclusion of parental haplotypes in the database. The accuracy reduction with GBS is likely associated with the small overlap between GBS markers and the exome capture dataset, which was used for constructing PHG. The highest imputation accuracy was obtained with exome capture for the wheat D genome, which also showed the highest levels of linkage disequilibrium and proportion of identity-by-descent regions among accessions in the PHG database. We demonstrate that genetic mapping based on genotypes imputed using PHG identifies SNPs with a broader range of effect sizes that together explain a higher proportion of genetic variance for heading date and meiotic crossover rate compared to previous studies. … (more)
- Is Part Of:
- G3. Volume 12:Issue 2(2022)
- Journal:
- G3
- Issue:
- Volume 12:Issue 2(2022)
- Issue Display:
- Volume 12, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 12
- Issue:
- 2
- Issue Sort Value:
- 2022-0012-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-11-09
- Subjects:
- wheat -- genotype imputation -- Practical Haplotype Graph -- skim-seq -- exome capture
Genetics -- Research -- Periodicals
Genomics -- Periodicals
Genetics
Genomics
Genes
Genetics -- Research
Genomics
Electronic journals
Periodical
Periodicals
Fulltext
Internet Resources
Periodicals
572.8 - Journal URLs:
- https://academic.oup.com/g3journal ↗
http://bibpurl.oclc.org/web/43467 ↗
http://www.g3journal.org ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1093/g3journal/jkab390 ↗
- Languages:
- English
- ISSNs:
- 2160-1836
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25872.xml