AIM-SNPtag: A computationally efficient approach for developing ancestry-informative SNP panels. (January 2019)
- Record Type:
- Journal Article
- Title:
- AIM-SNPtag: A computationally efficient approach for developing ancestry-informative SNP panels. (January 2019)
- Main Title:
- AIM-SNPtag: A computationally efficient approach for developing ancestry-informative SNP panels
- Authors:
- Zhao, Shilei
Shi, Cheng-Min
Ma, Liang
Liu, Qi
Liu, Yongming
Wu, Fuquan
Chi, Lianjiang
Chen, Hua - Abstract:
- Highlights: The computational method is highly efficient in extracting the most informative AIM panel including a small number of SNPs to achieve a certain degree of accuracy (e.g. 95% or 99%). The method is capable of exploring the genome-wide SNP data for a large number of populations. A 36-SNP panel was generated from the 1000 Genomes Project data using the new method, which achieved membership predictive accuracy of ∼99% for major human groups. Abstract: Inferring an individual's ancestry or group membership using a small set of highly informative genetic markers is very useful in forensic and medical genetics. However, given the huge amount of SNP data available from a diverse of populations, it is challenging to develop informative panels by exhaustively searching for all possible SNP combinations. In this study, we formulate it as an algorithm problem of selecting an optimal set of SNPs that maximizes the inference accuracy while minimizes the set size. Built on this conception, we develop a computational approach that is capable of constructing ancestry informative panels from multi-population genome-wide SNP data efficiently. We evaluated the performance of the method by comparing the panel size and membership inference accuracy of the constructed SNP panels to panels selected through empirical procedures in previous studies. For the membership inference of population groups including Asian, European, African, East Asian and Southeast Asian, a 36-SNP panel developedHighlights: The computational method is highly efficient in extracting the most informative AIM panel including a small number of SNPs to achieve a certain degree of accuracy (e.g. 95% or 99%). The method is capable of exploring the genome-wide SNP data for a large number of populations. A 36-SNP panel was generated from the 1000 Genomes Project data using the new method, which achieved membership predictive accuracy of ∼99% for major human groups. Abstract: Inferring an individual's ancestry or group membership using a small set of highly informative genetic markers is very useful in forensic and medical genetics. However, given the huge amount of SNP data available from a diverse of populations, it is challenging to develop informative panels by exhaustively searching for all possible SNP combinations. In this study, we formulate it as an algorithm problem of selecting an optimal set of SNPs that maximizes the inference accuracy while minimizes the set size. Built on this conception, we develop a computational approach that is capable of constructing ancestry informative panels from multi-population genome-wide SNP data efficiently. We evaluated the performance of the method by comparing the panel size and membership inference accuracy of the constructed SNP panels to panels selected through empirical procedures in previous studies. For the membership inference of population groups including Asian, European, African, East Asian and Southeast Asian, a 36-SNP panel developed by our approach has an overall accuracy of 99.07%, and a 21-SNP subset of the panel has an overall accuracy of 95.36%. In comparison, an existing panel requires 74 SNPs to achieve an accuracy of 94.14% on the same set of population groups. We further apply the method to four subpopulations within Europe (Finnish, British, Spanish and Italian); a 175-SNP panel can discriminate individuals of those European subpopulations with an accuracy of 99.36%, of which a 68-SNP subset can achieve an accuracy of 95.07%. We expect our method to be a useful tool for constructing ancestry informative markers in forensic genetics. … (more)
- Is Part Of:
- Forensic science international. Volume 38(2019)
- Journal:
- Forensic science international
- Issue:
- Volume 38(2019)
- Issue Display:
- Volume 38, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 38
- Issue:
- 2019
- Issue Sort Value:
- 2019-0038-2019-0000
- Page Start:
- 245
- Page End:
- 253
- Publication Date:
- 2019-01
- Subjects:
- Ancestry inference -- Forensics -- Genome-wide SNPs -- Membership -- SNP panel
Forensic genetics -- Periodicals
Génétique légale -- Périodiques
Forensic genetics
Electronic journals
Periodicals
614.1 - Journal URLs:
- http://www.clinicalkey.com.au/dura/browse/journalIssue/18724973 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/18724973 ↗
http://www.sciencedirect.com/science/journal/18724973 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.fsigen.2018.10.015 ↗
- Languages:
- English
- ISSNs:
- 1872-4973
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3987.764050
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8857.xml