Using all Gene Families Vastly Expands Data Available for Phylogenomic Inference. (1st June 2022)
- Record Type:
- Journal Article
- Title:
- Using all Gene Families Vastly Expands Data Available for Phylogenomic Inference. (1st June 2022)
- Main Title:
- Using all Gene Families Vastly Expands Data Available for Phylogenomic Inference
- Authors:
- Smith, Megan L.
Vanderpool, Dan
Hahn, Matthew W. - Editors:
- Townsend, Jeffrey
- Abstract:
- Abstract: Traditionally, single-copy orthologs have been the gold standard in phylogenomics. Most phylogenomic studies identify putative single-copy orthologs using clustering approaches and retain families with a single sequence per species. This limits the amount of data available by excluding larger families. Recent advances have suggested several ways to include data from larger families. For instance, tree-based decomposition methods facilitate the extraction of orthologs from large families. Additionally, several methods for species tree inference are robust to the inclusion of paralogs and could use all of the data from larger families. Here, we explore the effects of using all families for phylogenetic inference by examining relationships among 26 primate species in detail and by analyzing five additional data sets. We compare single-copy families, orthologs extracted using tree-based decomposition approaches, and all families with all data. We explore several species tree inference methods, finding that identical trees are returned across nearly all subsets of the data and methods for primates. The relationships among Platyrrhini remain contentious; however, the species tree inference method matters more than the subset of data used. Using data from larger gene families drastically increases the number of genes available and leads to consistent estimates of branch lengths, nodal certainty and concordance, and inferences of introgression in primates. For the otherAbstract: Traditionally, single-copy orthologs have been the gold standard in phylogenomics. Most phylogenomic studies identify putative single-copy orthologs using clustering approaches and retain families with a single sequence per species. This limits the amount of data available by excluding larger families. Recent advances have suggested several ways to include data from larger families. For instance, tree-based decomposition methods facilitate the extraction of orthologs from large families. Additionally, several methods for species tree inference are robust to the inclusion of paralogs and could use all of the data from larger families. Here, we explore the effects of using all families for phylogenetic inference by examining relationships among 26 primate species in detail and by analyzing five additional data sets. We compare single-copy families, orthologs extracted using tree-based decomposition approaches, and all families with all data. We explore several species tree inference methods, finding that identical trees are returned across nearly all subsets of the data and methods for primates. The relationships among Platyrrhini remain contentious; however, the species tree inference method matters more than the subset of data used. Using data from larger gene families drastically increases the number of genes available and leads to consistent estimates of branch lengths, nodal certainty and concordance, and inferences of introgression in primates. For the other data sets, topological inferences are consistent whether single-copy families or orthologs extracted using decomposition approaches are analyzed. Using larger gene families is a promising approach to include more data in phylogenomics without sacrificing accuracy, at least when high-quality genomes are available. … (more)
- Is Part Of:
- Molecular biology and evolution. Volume 39:Number 6(2022)
- Journal:
- Molecular biology and evolution
- Issue:
- Volume 39:Number 6(2022)
- Issue Display:
- Volume 39, Issue 6 (2022)
- Year:
- 2022
- Volume:
- 39
- Issue:
- 6
- Issue Sort Value:
- 2022-0039-0006-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06-01
- Subjects:
- phylogenetics -- orthologs -- paralogs -- concatenation -- coalescence
Molecular biology -- Periodicals
Molecular evolution -- Periodicals
Evolution, Molecular -- Periodicals
Molecular Biology -- Periodicals
572.8 - Journal URLs:
- http://mbe.oxfordjournals.org/ ↗
http://www.molbiolevol.org/ ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0737-7038;screen=info;ECOIP ↗ - DOI:
- 10.1093/molbev/msac112 ↗
- Languages:
- English
- ISSNs:
- 0737-4038
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.782000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21818.xml