BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection. Issue 19 (3rd September 2021)
- Record Type:
- Journal Article
- Title:
- BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection. Issue 19 (3rd September 2021)
- Main Title:
- BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
- Authors:
- Forni, Giobbe
Ruggieri, Angelo Alberto
Piccinini, Giovanni
Luchetti, Andrea - Abstract:
- Abstract: Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (d N /d S, also known as ω ). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforwardAbstract: Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (d N /d S, also known as ω ). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforward and reproducible manner. The workflow—along with all relevant documentation—is available at github.com/for‐giobbe/BASE. Abstract : Comparative genomics analyses for selection are often restricted to the subset of genes which is shared by all the species considered (i.e., ubiquitous genes) while those which are not found across some of them (i.e., nonubiquitous genes) are often overlooked, due to the lack of automated approaches to include them. Yet, disregarding such a large portion of genes may potentially conceal important evolutionary processes. For this reason, we developed a novel workflow—named BASE—which allows the integration of nonubiquitous genes in a straightforward and reproducible manner. … (more)
- Is Part Of:
- Ecology and evolution. Volume 11:Issue 19(2021)
- Journal:
- Ecology and evolution
- Issue:
- Volume 11:Issue 19(2021)
- Issue Display:
- Volume 11, Issue 19 (2021)
- Year:
- 2021
- Volume:
- 11
- Issue:
- 19
- Issue Sort Value:
- 2021-0011-0019-0000
- Page Start:
- 13029
- Page End:
- 13035
- Publication Date:
- 2021-09-03
- Subjects:
- CodeML -- dN/dS -- molecular evolution -- omega -- selective pressures
Ecology -- Periodicals
Evolution -- Periodicals
577.05 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2045-7758 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/ece3.7959 ↗
- Languages:
- English
- ISSNs:
- 2045-7758
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19373.xml