Applications of random forest feature selection for fine‐scale genetic population assignment. (14th September 2017)
- Record Type:
- Journal Article
- Title:
- Applications of random forest feature selection for fine‐scale genetic population assignment. (14th September 2017)
- Main Title:
- Applications of random forest feature selection for fine‐scale genetic population assignment
- Authors:
- Sylvester, Emma V. A.
Bentzen, Paul
Bradbury, Ian R.
Clément, Marie
Pearce, Jon
Horne, John
Beiko, Robert G. - Abstract:
- Abstract: Genetic population assignment used to inform wildlife management and conservation efforts requires panels of highly informative genetic markers and sensitive assignment tests. We explored the utility of machine‐learning algorithms (random forest, regularized random forest and guided regularized random forest) compared with F ST ranking for selection of single nucleotide polymorphisms (SNP) for fine‐scale population assignment. We applied these methods to an unpublished SNP data set for Atlantic salmon ( Salmo salar ) and a published SNP data set for Alaskan Chinook salmon ( Oncorhynchus tshawytscha ). In each species, we identified the minimum panel size required to obtain a self‐assignment accuracy of at least 90% using each method to create panels of 50–700 markers Panels of SNPs identified using random forest‐based methods performed up to 7.8 and 11.2 percentage points better than F ST ‐selected panels of similar size for the Atlantic salmon and Chinook salmon data, respectively. Self‐assignment accuracy ≥90% was obtained with panels of 670 and 384 SNPs for each data set, respectively, a level of accuracy never reached for these species using F ST ‐selected panels. Our results demonstrate a role for machine‐learning approaches in marker selection across large genomic data sets to improve assignment for management and conservation of exploited populations.
- Is Part Of:
- Evolutionary applications. Volume 11:Number 2(2018)
- Journal:
- Evolutionary applications
- Issue:
- Volume 11:Number 2(2018)
- Issue Display:
- Volume 11, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 11
- Issue:
- 2
- Issue Sort Value:
- 2018-0011-0002-0000
- Page Start:
- 153
- Page End:
- 165
- Publication Date:
- 2017-09-14
- Subjects:
- conservation genetics -- fisheries management -- individual assignment -- random forest -- SNP selection
Evolution (Biology) -- Periodicals
Genetics -- Periodicals
Natural selection -- Periodicals
Ecology -- Periodicals
576.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1752-4571 ↗
http://www.blackwellpublishing.com/journal.asp?ref=1752-4571&site=1 ↗
http://www3.interscience.wiley.com/journal/119423602/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/eva.12524 ↗
- Languages:
- English
- ISSNs:
- 1752-4571
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3834.390500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5712.xml