Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets. (25th June 2018)
- Record Type:
- Journal Article
- Title:
- Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets. (25th June 2018)
- Main Title:
- Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets
- Authors:
- Dornburg, Alex
Su, Zhuo
Townsend, Jeffrey P - Editors:
- Mueller, Rachel
- Abstract:
- Abstract: With the rise of genome-scale data sets, there has been a call for increased data scrutiny and careful selection of loci that are appropriate to use in an attempt to resolve a phylogenetic problem. Such loci should maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve at an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Herein, we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters—not genes—are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogeneticAbstract: With the rise of genome-scale data sets, there has been a call for increased data scrutiny and careful selection of loci that are appropriate to use in an attempt to resolve a phylogenetic problem. Such loci should maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve at an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Herein, we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters—not genes—are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design. [Divergence time; GC bias; homoplasy; incongruence; information content; internode length; optimal rates; phylogenetic informativeness; phylogenetic theory; phylogenetic utility; phylogenomics; signal and noise; subtending branch length; state space; taxon and character sampling.] … (more)
- Is Part Of:
- Systematic biology. Volume 68:Number 1(2019)
- Journal:
- Systematic biology
- Issue:
- Volume 68:Number 1(2019)
- Issue Display:
- Volume 68, Issue 1 (2019)
- Year:
- 2019
- Volume:
- 68
- Issue:
- 1
- Issue Sort Value:
- 2019-0068-0001-0000
- Page Start:
- 145
- Page End:
- 156
- Publication Date:
- 2018-06-25
- Subjects:
- Biology -- Classification -- Periodicals
Biology -- Periodicals
Biologie -- Classification -- Périodiques
Biologie -- Périodiques
578.012 - Journal URLs:
- http://ukcatalogue.oup.com/ ↗
- DOI:
- 10.1093/sysbio/syy047 ↗
- Languages:
- English
- ISSNs:
- 1063-5157
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8589.180700
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11803.xml