Optimization and performance testing of a sequence processing pipeline applied to detection of nonindigenous species. (20th February 2018)
- Record Type:
- Journal Article
- Title:
- Optimization and performance testing of a sequence processing pipeline applied to detection of nonindigenous species. (20th February 2018)
- Main Title:
- Optimization and performance testing of a sequence processing pipeline applied to detection of nonindigenous species
- Authors:
- Scott, Ryan
Zhan, Aibin
Brown, Emily A.
Chain, Frédéric J. J.
Cristescu, Melania E.
Gras, Robin
MacIsaac, Hugh J. - Abstract:
- Abstract: Genetic taxonomic assignment can be more sensitive than morphological taxonomic assignment, particularly for small, cryptic or rare species. Sequence processing is essential to taxonomic assignment, but can also produce errors because optimal parameters are not known a priori. Here, we explored how sequence processing parameters influence taxonomic assignment of 18S sequences from bulk zooplankton samples produced by 454 pyrosequencing. We optimized a sequence processing pipeline for two common research goals, estimation of species richness and early detection of aquatic invasive species (AIS), and then tested most optimal models' performances through simulations. We tested 1, 050 parameter sets on 18S sequences from 20 AIS to determine optimal parameters for each research goal. We tested optimized pipelines' performances (detectability and sensitivity) by computationally inoculating sequences of 20 AIS into ten bulk zooplankton samples from ports across Canada. We found that optimal parameter selection generally depends on the research goal. However, regardless of research goal, we found that metazoan 18S sequences produced by 454 pyrosequencing should be trimmed to 375–400 bp and sequence quality filtering should be relaxed (1.5 ≤ maximum expected error ≤ 3.0, Phred score = 10). Clustering and denoising were only viable for estimating species richness, because these processing steps made some species undetectable at low sequence abundances which would not beAbstract: Genetic taxonomic assignment can be more sensitive than morphological taxonomic assignment, particularly for small, cryptic or rare species. Sequence processing is essential to taxonomic assignment, but can also produce errors because optimal parameters are not known a priori. Here, we explored how sequence processing parameters influence taxonomic assignment of 18S sequences from bulk zooplankton samples produced by 454 pyrosequencing. We optimized a sequence processing pipeline for two common research goals, estimation of species richness and early detection of aquatic invasive species (AIS), and then tested most optimal models' performances through simulations. We tested 1, 050 parameter sets on 18S sequences from 20 AIS to determine optimal parameters for each research goal. We tested optimized pipelines' performances (detectability and sensitivity) by computationally inoculating sequences of 20 AIS into ten bulk zooplankton samples from ports across Canada. We found that optimal parameter selection generally depends on the research goal. However, regardless of research goal, we found that metazoan 18S sequences produced by 454 pyrosequencing should be trimmed to 375–400 bp and sequence quality filtering should be relaxed (1.5 ≤ maximum expected error ≤ 3.0, Phred score = 10). Clustering and denoising were only viable for estimating species richness, because these processing steps made some species undetectable at low sequence abundances which would not be useful for early detection of AIS. With parameter sets optimized for early detection of AIS, 90% of AIS were detected with fewer than 11 target sequences, regardless of whether clustering or denoising was used. Despite developments in next‐generation sequencing, sequence processing remains an important issue owing to difficulties in balancing false‐positive and false‐negative errors in metabarcoding data. … (more)
- Is Part Of:
- Evolutionary applications. Volume 11:Number 6(2018)
- Journal:
- Evolutionary applications
- Issue:
- Volume 11:Number 6(2018)
- Issue Display:
- Volume 11, Issue 6 (2018)
- Year:
- 2018
- Volume:
- 11
- Issue:
- 6
- Issue Sort Value:
- 2018-0011-0006-0000
- Page Start:
- 891
- Page End:
- 905
- Publication Date:
- 2018-02-20
- Subjects:
- aquatic invasive species -- biomonitoring -- clustering -- high‐throughput sequencing -- metabarcoding -- sequence processing
Evolution (Biology) -- Periodicals
Genetics -- Periodicals
Natural selection -- Periodicals
Ecology -- Periodicals
576.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1752-4571 ↗
http://www.blackwellpublishing.com/journal.asp?ref=1752-4571&site=1 ↗
http://www3.interscience.wiley.com/journal/119423602/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/eva.12604 ↗
- Languages:
- English
- ISSNs:
- 1752-4571
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3834.390500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6884.xml