Tailored Selection of Study Individuals to be Sequenced in Order to Improve the Accuracy of Genotype Imputation. Issue 2 (23rd December 2014)
- Record Type:
- Journal Article
- Title:
- Tailored Selection of Study Individuals to be Sequenced in Order to Improve the Accuracy of Genotype Imputation. Issue 2 (23rd December 2014)
- Main Title:
- Tailored Selection of Study Individuals to be Sequenced in Order to Improve the Accuracy of Genotype Imputation
- Authors:
- Peil, Barbara
Kabisch, Maria
Fischer, Christine
Hamann, Ute
Bermejo, Justo Lorenzo - Abstract:
- <abstract abstract-type="main"> <title>Abstract</title> <p>The addition of sequence data from own‐study individuals to genotypes from external data repositories, for example, the HapMap, has been shown to improve the accuracy of imputed genotypes. Early approaches for reference panel selection favored individuals who best reflect recombination patterns in the study population. By contrast, a maximization of genetic diversity in the reference panel has been recently proposed. We investigate here a novel strategy to select individuals for sequencing that relies on the characterization of the ancestral kernel of the study population. The simulated study scenarios consisted of several combinations of subpopulations from HapMap. HapMap individuals who did not belong to the study population constituted an external reference panel which was complemented with the sequences of study individuals selected according to different strategies. In addition to a random choice, individuals with the largest statistical depth according to the first genetic principal components were selected. In all simulated scenarios the integration of sequences from own‐study individuals increased imputation accuracy. The selection of individuals based on the statistical depth resulted in the highest imputation accuracy for European and Asian study scenarios, whereas random selection performed best for an African‐study scenario. Present findings indicate that there is no universal 'best strategy' to select<abstract abstract-type="main"> <title>Abstract</title> <p>The addition of sequence data from own‐study individuals to genotypes from external data repositories, for example, the HapMap, has been shown to improve the accuracy of imputed genotypes. Early approaches for reference panel selection favored individuals who best reflect recombination patterns in the study population. By contrast, a maximization of genetic diversity in the reference panel has been recently proposed. We investigate here a novel strategy to select individuals for sequencing that relies on the characterization of the ancestral kernel of the study population. The simulated study scenarios consisted of several combinations of subpopulations from HapMap. HapMap individuals who did not belong to the study population constituted an external reference panel which was complemented with the sequences of study individuals selected according to different strategies. In addition to a random choice, individuals with the largest statistical depth according to the first genetic principal components were selected. In all simulated scenarios the integration of sequences from own‐study individuals increased imputation accuracy. The selection of individuals based on the statistical depth resulted in the highest imputation accuracy for European and Asian study scenarios, whereas random selection performed best for an African‐study scenario. Present findings indicate that there is no universal 'best strategy' to select individuals for sequencing. We propose to use the methodology described in the manuscript to assess the advantage of focusing on the ancestral kernel under own study characteristics (study size, genetic diversity, availability and properties of external reference panels, frequency of imputed variants…).</p> </abstract> … (more)
- Is Part Of:
- Genetic epidemiology. Volume 39:Issue 2(2015)
- Journal:
- Genetic epidemiology
- Issue:
- Volume 39:Issue 2(2015)
- Issue Display:
- Volume 39, Issue 2 (2015)
- Year:
- 2015
- Volume:
- 39
- Issue:
- 2
- Issue Sort Value:
- 2015-0039-0002-0000
- Page Start:
- 114
- Page End:
- 121
- Publication Date:
- 2014-12-23
- Subjects:
- Genetic epidemiology -- Periodicals
Heredity -- Periodicals
Medical geography -- Periodicals
614 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-2272 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/gepi.21873 ↗
- Languages:
- English
- ISSNs:
- 0741-0395
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4111.848000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3288.xml