ScaffoldSeq: Software for characterization of directed evolution populations. Issue 7 (16th April 2016)
- Record Type:
- Journal Article
- Title:
- ScaffoldSeq: Software for characterization of directed evolution populations. Issue 7 (16th April 2016)
- Main Title:
- ScaffoldSeq: Software for characterization of directed evolution populations
- Authors:
- Woldring, Daniel R.
Holec, Patrick V.
Hackel, Benjamin J. - Abstract:
- ABSTRACT: ScaffoldSeq is software designed for the numerous applications—including directed evolution analysis—in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high‐throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site‐specific manner as well as identify epistatic pairwise interactions. TheABSTRACT: ScaffoldSeq is software designed for the numerous applications—including directed evolution analysis—in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high‐throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site‐specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available athttp://research.cems.umn.edu/hackel . Proteins 2016; 84:869–874. © 2016 Wiley Periodicals, Inc. … (more)
- Is Part Of:
- Proteins. Volume 84:Issue 7(2016)
- Journal:
- Proteins
- Issue:
- Volume 84:Issue 7(2016)
- Issue Display:
- Volume 84, Issue 7 (2016)
- Year:
- 2016
- Volume:
- 84
- Issue:
- 7
- Issue Sort Value:
- 2016-0084-0007-0000
- Page Start:
- 869
- Page End:
- 874
- Publication Date:
- 2016-04-16
- Subjects:
- bioinformatics -- high‐throughput -- sequence analysis -- family clustering -- epistasis -- software
Proteins -- Periodicals
Proteins -- Periodicals
572.6 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/prot.25040 ↗
- Languages:
- English
- ISSNs:
- 0887-3585
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6936.164000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 185.xml