Accounting for missing data in the estimation of contemporary genetic effective population size (Ne). (29th December 2012)
- Record Type:
- Journal Article
- Title:
- Accounting for missing data in the estimation of contemporary genetic effective population size (Ne). (29th December 2012)
- Main Title:
- Accounting for missing data in the estimation of contemporary genetic effective population size (Ne)
- Authors:
- Peel, D.
Waples, R. S.
Macbeth, G. M.
Do, C.
Ovenden, J. R. - Abstract:
- <abstract abstract-type="main" id="men12049-abs-0001"> <title>Abstract</title> <p>Theoretical models are often applied to population genetic data sets without fully considering the effect of missing data. Researchers can deal with missing data by removing individuals that have failed to yield genotypes and/or by removing loci that have failed to yield allelic determinations, but despite their best efforts, most data sets still contain some missing data. As a consequence, realized sample size differs among loci, and this poses a problem for unbiased methods that must explicitly account for random sampling error. One commonly used solution for the calculation of contemporary effective population size (<italic>N</italic><sub>e</sub>) is to calculate the effective sample size as an unweighted mean or harmonic mean across loci. This is not ideal because it fails to account for the fact that loci with different numbers of alleles have different information content. Here we consider this problem for genetic estimators of contemporary effective population size (<italic>N</italic><sub>e</sub>). To evaluate bias and precision of several statistical approaches for dealing with missing data, we simulated populations with known <italic>N</italic><sub>e</sub> and various degrees of missing data. Across all scenarios, one method of correcting for missing data (fixed‐inverse variance‐weighted harmonic mean) consistently performed the best for both single‐sample and two‐sample (temporal)<abstract abstract-type="main" id="men12049-abs-0001"> <title>Abstract</title> <p>Theoretical models are often applied to population genetic data sets without fully considering the effect of missing data. Researchers can deal with missing data by removing individuals that have failed to yield genotypes and/or by removing loci that have failed to yield allelic determinations, but despite their best efforts, most data sets still contain some missing data. As a consequence, realized sample size differs among loci, and this poses a problem for unbiased methods that must explicitly account for random sampling error. One commonly used solution for the calculation of contemporary effective population size (<italic>N</italic><sub>e</sub>) is to calculate the effective sample size as an unweighted mean or harmonic mean across loci. This is not ideal because it fails to account for the fact that loci with different numbers of alleles have different information content. Here we consider this problem for genetic estimators of contemporary effective population size (<italic>N</italic><sub>e</sub>). To evaluate bias and precision of several statistical approaches for dealing with missing data, we simulated populations with known <italic>N</italic><sub>e</sub> and various degrees of missing data. Across all scenarios, one method of correcting for missing data (fixed‐inverse variance‐weighted harmonic mean) consistently performed the best for both single‐sample and two‐sample (temporal) methods of estimating <italic>N</italic><sub>e</sub> and outperformed some methods currently in widespread use. The approach adopted here may be a starting point to adjust other population genetics methods that include per‐locus sample size components.</p> </abstract> … (more)
- Is Part Of:
- Molecular ecology resources. Volume 13:Number 2(2013:Mar.)
- Journal:
- Molecular ecology resources
- Issue:
- Volume 13:Number 2(2013:Mar.)
- Issue Display:
- Volume 13, Issue 2 (2013)
- Year:
- 2013
- Volume:
- 13
- Issue:
- 2
- Issue Sort Value:
- 2013-0013-0002-0000
- Page Start:
- 243
- Page End:
- 253
- Publication Date:
- 2012-12-29
- Subjects:
- Molecular ecology -- Periodicals
572.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1755-0998 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/1755-0998.12049 ↗
- Languages:
- English
- ISSNs:
- 1755-098X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.817368
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 3859.xml