Consistency of SVDQuartets and Maximum Likelihood for Coalescent-Based Species Tree Estimation. (16th May 2020)
- Record Type:
- Journal Article
- Title:
- Consistency of SVDQuartets and Maximum Likelihood for Coalescent-Based Species Tree Estimation. (16th May 2020)
- Main Title:
- Consistency of SVDQuartets and Maximum Likelihood for Coalescent-Based Species Tree Estimation
- Authors:
- Wascher, Matthew
Kubatko, Laura - Editors:
- Susko, Edward
- Abstract:
- Abstract: Numerous methods for inferring species-level phylogenies under the coalescent model have been proposed within the last 20 years, and debates continue about the relative strengths and weaknesses of these methods. One desirable property of a phylogenetic estimator is that of statistical consistency, which means intuitively that as more data are collected, the probability that the estimated tree has the same topology as the true tree goes to 1. To date, consistency results for species tree inference under the multispecies coalescent (MSC) have been derived only for summary statistics methods, such as ASTRAL and MP-EST. These methods have been found to be consistent given true gene trees but may be inconsistent when gene trees are estimated from data for loci of finite length. Here, we consider the question of statistical consistency for four taxa for SVDQuartets for general data types, as well as for the maximum likelihood (ML) method in the case in which the data are a collection of sites generated under the MSC model such that the sites are conditionally independent given the species tree (we call these data coalescent independent sites [CIS] data). We show that SVDQuartets is statistically consistent for all data types (i.e., for both CIS data and for multilocus data), and we derive its rate of convergence. We additionally show that ML is consistent for CIS data under the JC69 model and discuss why a proof for the more general multilocus case is difficult. Finally,Abstract: Numerous methods for inferring species-level phylogenies under the coalescent model have been proposed within the last 20 years, and debates continue about the relative strengths and weaknesses of these methods. One desirable property of a phylogenetic estimator is that of statistical consistency, which means intuitively that as more data are collected, the probability that the estimated tree has the same topology as the true tree goes to 1. To date, consistency results for species tree inference under the multispecies coalescent (MSC) have been derived only for summary statistics methods, such as ASTRAL and MP-EST. These methods have been found to be consistent given true gene trees but may be inconsistent when gene trees are estimated from data for loci of finite length. Here, we consider the question of statistical consistency for four taxa for SVDQuartets for general data types, as well as for the maximum likelihood (ML) method in the case in which the data are a collection of sites generated under the MSC model such that the sites are conditionally independent given the species tree (we call these data coalescent independent sites [CIS] data). We show that SVDQuartets is statistically consistent for all data types (i.e., for both CIS data and for multilocus data), and we derive its rate of convergence. We additionally show that ML is consistent for CIS data under the JC69 model and discuss why a proof for the more general multilocus case is difficult. Finally, we compare the performance of ML and SDVQuartets using simulation for both data types. [Consistency; gene tree; maximum likelihood; multilocus data; hylogenetic inference; species tree; SVDQuartets.] … (more)
- Is Part Of:
- Systematic biology. Volume 70:Number 1(2021)
- Journal:
- Systematic biology
- Issue:
- Volume 70:Number 1(2021)
- Issue Display:
- Volume 70, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 70
- Issue:
- 1
- Issue Sort Value:
- 2021-0070-0001-0000
- Page Start:
- 33
- Page End:
- 48
- Publication Date:
- 2020-05-16
- Subjects:
- Biology -- Classification -- Periodicals
Biology -- Periodicals
Biologie -- Classification -- Périodiques
Biologie -- Périodiques
578.012 - Journal URLs:
- http://ukcatalogue.oup.com/ ↗
- DOI:
- 10.1093/sysbio/syaa039 ↗
- Languages:
- English
- ISSNs:
- 1063-5157
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8589.180700
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15223.xml