An Integrated Perspective on Phylogenetic Workflows. (February 2016)
- Record Type:
- Journal Article
- Title:
- An Integrated Perspective on Phylogenetic Workflows. (February 2016)
- Main Title:
- An Integrated Perspective on Phylogenetic Workflows
- Authors:
- Guang, August
Zapata, Felipe
Howison, Mark
Lawrence, Charles E.
Dunn, Casey W. - Abstract:
- Abstract : Molecular phylogenetics is the study of evolutionary relationships between biological sequences, often to infer the evolutionary relationships of organisms. These studies require many analysis components, including sequence assembly, identification of homologous sequences, gene tree inference, and species tree inference. At present, each component is usually treated as a single step in a linear analysis, where the output of each component is passed as input to the next as a point estimate. Here we outline a generative model that helps clarify assumptions that are implicit to phylogenetic workflows, focusing on the assumption of low relative entropy. This perspective unifies currently disparate advances, and will help investigators evaluate which steps would benefit the most from additional computation and future methods development. Trends: Current phylogenetic analyses are implemented as multistep, linear workflows where intermediate analysis steps generate and pass on point estimates of unobserved variables. This linear structure and minimal information communication strategy embody three implicit assumptions: (i) the order of the analysis steps is biologically justified, (ii) a Markovian dependency structure, and (iii) low relative entropy of results of each analysis step. There is evidence that these assumptions, in particular low relative entropy, are frequently violated in empirical studies with potential detrimental effects in phylogenetic analyses. AAbstract : Molecular phylogenetics is the study of evolutionary relationships between biological sequences, often to infer the evolutionary relationships of organisms. These studies require many analysis components, including sequence assembly, identification of homologous sequences, gene tree inference, and species tree inference. At present, each component is usually treated as a single step in a linear analysis, where the output of each component is passed as input to the next as a point estimate. Here we outline a generative model that helps clarify assumptions that are implicit to phylogenetic workflows, focusing on the assumption of low relative entropy. This perspective unifies currently disparate advances, and will help investigators evaluate which steps would benefit the most from additional computation and future methods development. Trends: Current phylogenetic analyses are implemented as multistep, linear workflows where intermediate analysis steps generate and pass on point estimates of unobserved variables. This linear structure and minimal information communication strategy embody three implicit assumptions: (i) the order of the analysis steps is biologically justified, (ii) a Markovian dependency structure, and (iii) low relative entropy of results of each analysis step. There is evidence that these assumptions, in particular low relative entropy, are frequently violated in empirical studies with potential detrimental effects in phylogenetic analyses. A generative model and probabilistic framework provide a unified perspective to assess the costs and benefits of relaxing these assumptions, help identify what methods and tools are missing, and provide a context for evaluating priorities for future development. … (more)
- Is Part Of:
- Trends in ecology & evolution. Volume 31:Number 2(2016)
- Journal:
- Trends in ecology & evolution
- Issue:
- Volume 31:Number 2(2016)
- Issue Display:
- Volume 31, Issue 2 (2016)
- Year:
- 2016
- Volume:
- 31
- Issue:
- 2
- Issue Sort Value:
- 2016-0031-0002-0000
- Page Start:
- 116
- Page End:
- 126
- Publication Date:
- 2016-02
- Subjects:
- Ecology -- Periodicals
Evolution (Biology) -- Periodicals
576.8 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01695347 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.tree.2015.12.007 ↗
- Languages:
- English
- ISSNs:
- 0169-5347
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 9049.569000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8785.xml