SNP discovery in nonmodel organisms: strand bias and base‐substitution errors reduce conversion rates. (23rd November 2014)
- Record Type:
- Journal Article
- Title:
- SNP discovery in nonmodel organisms: strand bias and base‐substitution errors reduce conversion rates. (23rd November 2014)
- Main Title:
- SNP discovery in nonmodel organisms: strand bias and base‐substitution errors reduce conversion rates
- Authors:
- Gonçalves da Silva, Anders
Barendse, William
Kijas, James W.
Barris, Wes C.
McWilliam, Sean
Bunch, Rowan J.
McCullough, Russell
Harrison, Blair
Hoelzel, A. Rus
England, Phillip R. - Abstract:
- <abstract abstract-type="main" id="men12343-abs-0001"> <title>Abstract</title> <p>Single nucleotide polymorphisms (SNPs) have become the marker of choice for genetic studies in organisms of conservation, commercial or biological interest. Most SNP discovery projects in nonmodel organisms apply a strategy for identifying putative SNPs based on filtering rules that account for random sequencing errors. Here, we analyse data used to develop 4723 novel SNPs for the commercially important deep‐sea fish, orange roughy (<italic>Hoplostethus atlanticus</italic>), to assess the impact of not accounting for systematic sequencing errors when filtering identified polymorphisms when discovering SNPs. We used SAMtools to identify polymorphisms in a <sc>velvet</sc> assembly of genomic DNA sequence data from seven individuals. The resulting set of polymorphisms were filtered to minimize 'bycatch'—polymorphisms caused by sequencing or assembly error. An Illumina Infinium SNP chip was used to genotype a final set of 7714 polymorphisms across 1734 individuals. Five predictors were examined for their effect on the probability of obtaining an assayable SNP: depth of coverage, number of reads that support a variant, polymorphism type (e.g. A/C), strand‐bias and Illumina SNP probe design score. Our results indicate that filtering out systematic sequencing errors could substantially improve the efficiency of SNP discovery. We show that <italic>BLASTX</italic> can be used as an efficient tool to<abstract abstract-type="main" id="men12343-abs-0001"> <title>Abstract</title> <p>Single nucleotide polymorphisms (SNPs) have become the marker of choice for genetic studies in organisms of conservation, commercial or biological interest. Most SNP discovery projects in nonmodel organisms apply a strategy for identifying putative SNPs based on filtering rules that account for random sequencing errors. Here, we analyse data used to develop 4723 novel SNPs for the commercially important deep‐sea fish, orange roughy (<italic>Hoplostethus atlanticus</italic>), to assess the impact of not accounting for systematic sequencing errors when filtering identified polymorphisms when discovering SNPs. We used SAMtools to identify polymorphisms in a <sc>velvet</sc> assembly of genomic DNA sequence data from seven individuals. The resulting set of polymorphisms were filtered to minimize 'bycatch'—polymorphisms caused by sequencing or assembly error. An Illumina Infinium SNP chip was used to genotype a final set of 7714 polymorphisms across 1734 individuals. Five predictors were examined for their effect on the probability of obtaining an assayable SNP: depth of coverage, number of reads that support a variant, polymorphism type (e.g. A/C), strand‐bias and Illumina SNP probe design score. Our results indicate that filtering out systematic sequencing errors could substantially improve the efficiency of SNP discovery. We show that <italic>BLASTX</italic> can be used as an efficient tool to identify single‐copy genomic regions in the absence of a reference genome. The results have implications for research aiming to identify assayable SNPs and build SNP genotyping assays for nonmodel organisms.</p> </abstract> … (more)
- Is Part Of:
- Molecular ecology resources. Volume 15:Number 4(2015:Jul.)
- Journal:
- Molecular ecology resources
- Issue:
- Volume 15:Number 4(2015:Jul.)
- Issue Display:
- Volume 15, Issue 4 (2015)
- Year:
- 2015
- Volume:
- 15
- Issue:
- 4
- Issue Sort Value:
- 2015-0015-0004-0000
- Page Start:
- 723
- Page End:
- 736
- Publication Date:
- 2014-11-23
- Subjects:
- Molecular ecology -- Periodicals
572.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1755-0998 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/1755-0998.12343 ↗
- Languages:
- English
- ISSNs:
- 1755-098X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.817368
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 3657.xml