Estimation of population allele frequencies from next‐generation sequencing data: pool‐versus individual‐based genotyping. Issue 14 (4th June 2013)
- Record Type:
- Journal Article
- Title:
- Estimation of population allele frequencies from next‐generation sequencing data: pool‐versus individual‐based genotyping. Issue 14 (4th June 2013)
- Main Title:
- Estimation of population allele frequencies from next‐generation sequencing data: pool‐versus individual‐based genotyping
- Authors:
- Gautier, Mathieu
Foucaud, Julien
Gharbi, Karim
Cézard, Timothée
Galan, Maxime
Loiseau, Anne
Thomson, Marian
Pudlo, Pierre
Kerdelhué, Carole
Estoup, Arnaud - Abstract:
- <abstract abstract-type="main" id="mec12360-abs-0001"> <title>Abstract</title> <p>Molecular markers produced by next‐generation sequencing (NGS) technologies are revolutionizing genetic research. However, the costs of analysing large numbers of individual genomes remain prohibitive for most population genetics studies. Here, we present results based on mathematical derivations showing that, under many realistic experimental designs, NGS of DNA pools from diploid individuals allows to estimate the allele frequencies at single nucleotide polymorphisms (SNPs) with at least the same accuracy as individual‐based analyses, for considerably lower library construction and sequencing efforts. These findings remain true when taking into account the possibility of substantially unequal contributions of each individual to the final pool of sequence reads. We propose the intuitive notion of effective pool size to account for unequal pooling and derive a Bayesian hierarchical model to estimate this parameter directly from the data. We provide a user‐friendly application assessing the accuracy of allele frequency estimation from both pool‐ and individual‐based NGS population data under various sampling, sequencing depth and experimental error designs. We illustrate our findings with theoretical examples and real data sets corresponding to SNP loci obtained using restriction site–associated DNA (RAD) sequencing in pool‐ and individual‐based experiments carried out on the same population of<abstract abstract-type="main" id="mec12360-abs-0001"> <title>Abstract</title> <p>Molecular markers produced by next‐generation sequencing (NGS) technologies are revolutionizing genetic research. However, the costs of analysing large numbers of individual genomes remain prohibitive for most population genetics studies. Here, we present results based on mathematical derivations showing that, under many realistic experimental designs, NGS of DNA pools from diploid individuals allows to estimate the allele frequencies at single nucleotide polymorphisms (SNPs) with at least the same accuracy as individual‐based analyses, for considerably lower library construction and sequencing efforts. These findings remain true when taking into account the possibility of substantially unequal contributions of each individual to the final pool of sequence reads. We propose the intuitive notion of effective pool size to account for unequal pooling and derive a Bayesian hierarchical model to estimate this parameter directly from the data. We provide a user‐friendly application assessing the accuracy of allele frequency estimation from both pool‐ and individual‐based NGS population data under various sampling, sequencing depth and experimental error designs. We illustrate our findings with theoretical examples and real data sets corresponding to SNP loci obtained using restriction site–associated DNA (RAD) sequencing in pool‐ and individual‐based experiments carried out on the same population of the pine processionary moth (<italic>Thaumetopoea pityocampa)</italic>. NGS of DNA pools might not be optimal for all types of studies but provides a cost‐effective approach for estimating allele frequencies for very large numbers of SNPs. It thus allows comparison of genome‐wide patterns of genetic variation for large numbers of individuals in multiple populations.</p> </abstract> … (more)
- Is Part Of:
- Molecular ecology. Volume 22:Issue 14(2013)
- Journal:
- Molecular ecology
- Issue:
- Volume 22:Issue 14(2013)
- Issue Display:
- Volume 22, Issue 14 (2013)
- Year:
- 2013
- Volume:
- 22
- Issue:
- 14
- Issue Sort Value:
- 2013-0022-0014-0000
- Page Start:
- 3766
- Page End:
- 3779
- Publication Date:
- 2013-06-04
- Subjects:
- Molecular ecology -- Periodicals
Molecular population biology -- Periodicals
576 - Journal URLs:
- http://www.blackwell-synergy.com/servlet/useragent?func=showIssues&code=mec&close=1999#C1999 ↗
http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1365-294X ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/mec.12360 ↗
- Languages:
- English
- ISSNs:
- 0962-1083
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.817360
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4134.xml