Genotyping Polyploids from Messy Sequencing Data. Issue 3 (5th September 2018)
- Record Type:
- Journal Article
- Title:
- Genotyping Polyploids from Messy Sequencing Data. Issue 3 (5th September 2018)
- Main Title:
- Genotyping Polyploids from Messy Sequencing Data
- Authors:
- Gerard, David
Ferrão, Luis Felipe Ventorim
Garcia, Antonio Augusto Franco
Stephens, Matthew - Abstract:
- Abstract: Gerard et al. highlight several issues encountered when genotyping polyploid organisms from next-generation sequencing data, including allelic bias, overdispersion, and outlying observations. They present modeling solutions and software to account for these issues... Detecting and quantifying the differences in individual genomes ( i.e., genotyping), plays a fundamental role in most modern bioinformatics pipelines. Many scientists now use reduced representation next-generation sequencing (NGS) approaches for genotyping. Genotyping diploid individuals using NGS is a well-studied field, and similar methods for polyploid individuals are just emerging. However, there are many aspects of NGS data, particularly in polyploids, that remain unexplored by most methods. Our contributions in this paper are fourfold: (i) We draw attention to, and then model, common aspects of NGS data: sequencing error, allelic bias, overdispersion, and outlying observations. (ii) Many datasets feature related individuals, and so we use the structure of Mendelian segregation to build an empirical Bayes approach for genotyping polyploid individuals. (iii) We develop novel models to account for preferential pairing of chromosomes, and harness these for genotyping. (iv) We derive oracle genotyping error rates that may be used for read depth suggestions. We assess the accuracy of our method in simulations, and apply it to a dataset of hexaploid sweet potato ( Ipomoea batatas ). An R packageAbstract: Gerard et al. highlight several issues encountered when genotyping polyploid organisms from next-generation sequencing data, including allelic bias, overdispersion, and outlying observations. They present modeling solutions and software to account for these issues... Detecting and quantifying the differences in individual genomes ( i.e., genotyping), plays a fundamental role in most modern bioinformatics pipelines. Many scientists now use reduced representation next-generation sequencing (NGS) approaches for genotyping. Genotyping diploid individuals using NGS is a well-studied field, and similar methods for polyploid individuals are just emerging. However, there are many aspects of NGS data, particularly in polyploids, that remain unexplored by most methods. Our contributions in this paper are fourfold: (i) We draw attention to, and then model, common aspects of NGS data: sequencing error, allelic bias, overdispersion, and outlying observations. (ii) Many datasets feature related individuals, and so we use the structure of Mendelian segregation to build an empirical Bayes approach for genotyping polyploid individuals. (iii) We develop novel models to account for preferential pairing of chromosomes, and harness these for genotyping. (iv) We derive oracle genotyping error rates that may be used for read depth suggestions. We assess the accuracy of our method in simulations, and apply it to a dataset of hexaploid sweet potato ( Ipomoea batatas ). An R package implementing our method is available at https://cran.r-project.org/package=updog . … (more)
- Is Part Of:
- Genetics. Volume 210:Issue 3(2018)
- Journal:
- Genetics
- Issue:
- Volume 210:Issue 3(2018)
- Issue Display:
- Volume 210, Issue 3 (2018)
- Year:
- 2018
- Volume:
- 210
- Issue:
- 3
- Issue Sort Value:
- 2018-0210-0003-0000
- Page Start:
- 789
- Page End:
- 807
- Publication Date:
- 2018-09-05
- Subjects:
- GBS -- RAD-Seq -- sequencing -- hierarchical modeling -- read-mapping bias
Genetics -- Periodicals
576.5 - Journal URLs:
- http://www.oxfordjournals.org/ ↗
- DOI:
- 10.1534/genetics.118.301468 ↗
- Languages:
- English
- ISSNs:
- 0016-6731
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25313.xml