APPLES: Scalable Distance-Based Phylogenetic Placement with or without Alignments. (23rd September 2019)
- Record Type:
- Journal Article
- Title:
- APPLES: Scalable Distance-Based Phylogenetic Placement with or without Alignments. (23rd September 2019)
- Main Title:
- APPLES: Scalable Distance-Based Phylogenetic Placement with or without Alignments
- Authors:
- Balaban, Metin
Sarmashghi, Shahab
Mirarab, Siavash - Editors:
- Posada, David
- Abstract:
- Abstract: Placing a new species on an existing phylogeny has increasing relevance to several applications. Placement can be used to update phylogenies in a scalable fashion and can help identify unknown query samples using (meta-)barcoding, skimming, or metagenomic data. Maximum likelihood (ML) methods of phylogenetic placement exist, but these methods are not scalable to reference trees with many thousands of leaves, limiting their ability to enjoy benefits of dense taxon sampling in modern reference libraries. They also rely on assembled sequences for the reference set and aligned sequences for the query. Thus, ML methods cannot analyze data sets where the reference consists of unassembled reads, a scenario relevant to emerging applications of genome skimming for sample identification. We introduce APPLES, a distance-based method for phylogenetic placement. Compared to ML, APPLES is an order of magnitude faster and more memory efficient, and unlike ML, it is able to place on large backbone trees (tested for up to 200, 000 leaves). We show that using dense references improves accuracy substantially so that APPLES on dense trees is more accurate than ML on sparser trees, where it can run. Finally, APPLES can accurately identify samples without assembled reference or aligned queries using kmer-based distances, a scenario that ML cannot handle. APPLES is available publically at github.com/balabanmetin/apples .
- Is Part Of:
- Systematic biology. Volume 69:Number 3(2020)
- Journal:
- Systematic biology
- Issue:
- Volume 69:Number 3(2020)
- Issue Display:
- Volume 69, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 69
- Issue:
- 3
- Issue Sort Value:
- 2020-0069-0003-0000
- Page Start:
- 566
- Page End:
- 578
- Publication Date:
- 2019-09-23
- Subjects:
- Distance-based methods -- genome skimming -- phylogenetic placement
Biology -- Classification -- Periodicals
Biology -- Periodicals
Biologie -- Classification -- Périodiques
Biologie -- Périodiques
578.012 - Journal URLs:
- http://ukcatalogue.oup.com/ ↗
- DOI:
- 10.1093/sysbio/syz063 ↗
- Languages:
- English
- ISSNs:
- 1063-5157
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8589.180700
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15045.xml