Interpretation and approximation tools for big, dense Markov chain transition matrices in population genetics. Issue 1 (December 2015)
- Record Type:
- Journal Article
- Title:
- Interpretation and approximation tools for big, dense Markov chain transition matrices in population genetics. Issue 1 (December 2015)
- Main Title:
- Interpretation and approximation tools for big, dense Markov chain transition matrices in population genetics
- Authors:
- Reichel, Katja
Bahier, Valentin
Midoux, Cédric
Parisey, Nicolas
Masson, Jean-Pierre
Stoeckel, Solenn - Abstract:
- Abstract Background Markov chains are a common framework for individual-based state and time discrete models in evolution. Though they played an important role in the development of basic population genetic theory, the analysis of more complex evolutionary scenarios typically involves approximation with other types of models. As the number of states increases, the big, dense transition matrices involved become increasingly unwieldy. However, advances in computational technology continue to reduce the challenges of "big data", thus giving new potential to state-rich Markov chains in theoretical population genetics. Results Using a population genetic model based on genotype frequencies as an example, we propose a set of methods to assist in the computation and interpretation of big, dense Markov chain transition matrices. With the help of network analysis, we demonstrate how they can be transformed into clear and easily interpretable graphs, providing a new perspective even on the classic case of a randomly mating, finite population with mutation. Moreover, we describe an algorithm to save computer memory by substituting the original matrix with a sparse approximate while preserving its mathematically important properties, including a closely corresponding dominant (normalized) eigenvector. A global sensitivity analysis of the approximation results in our example shows that size reduction of more than 90 % is possible without significantly affecting the basic model results.Abstract Background Markov chains are a common framework for individual-based state and time discrete models in evolution. Though they played an important role in the development of basic population genetic theory, the analysis of more complex evolutionary scenarios typically involves approximation with other types of models. As the number of states increases, the big, dense transition matrices involved become increasingly unwieldy. However, advances in computational technology continue to reduce the challenges of "big data", thus giving new potential to state-rich Markov chains in theoretical population genetics. Results Using a population genetic model based on genotype frequencies as an example, we propose a set of methods to assist in the computation and interpretation of big, dense Markov chain transition matrices. With the help of network analysis, we demonstrate how they can be transformed into clear and easily interpretable graphs, providing a new perspective even on the classic case of a randomly mating, finite population with mutation. Moreover, we describe an algorithm to save computer memory by substituting the original matrix with a sparse approximate while preserving its mathematically important properties, including a closely corresponding dominant (normalized) eigenvector. A global sensitivity analysis of the approximation results in our example shows that size reduction of more than 90 % is possible without significantly affecting the basic model results. Sample implementations of our methods are collected in the Python modulemamoth . Conclusion Our methods help to make stochastic population genetic models involving big, dense transition matrices computationally feasible. Our visualization techniques provide new ways to explore such models and concisely present the results. Thus, our methods will contribute to establish state-rich Markov chains as a valuable supplement to the diversity of population genetic models currently employed, providing interesting new details about evolution e.g. under non-standard reproductive systems such as partial clonality. … (more)
- Is Part Of:
- Algorithms for molecular biology. Volume 10:Issue 1(2015)
- Journal:
- Algorithms for molecular biology
- Issue:
- Volume 10:Issue 1(2015)
- Issue Display:
- Volume 10, Issue 1 (2015)
- Year:
- 2015
- Volume:
- 10
- Issue:
- 1
- Issue Sort Value:
- 2015-0010-0001-0000
- Page Start:
- 1
- Page End:
- 14
- Publication Date:
- 2015-12
- Subjects:
- Discrete stochastic model -- Sparse approximation -- Eigenvector -- Network analysis -- Population genetics -- Compositional data -- de Finetti diagram
Molecular biology -- Mathematical models -- Periodicals
Algorithms -- Periodicals
Bioinformatics -- Periodicals
572.8015118 - Journal URLs:
- http://pubmedcentral.com/tocrender.fcgi?journal=403&action=archive ↗
http://www.almob.org/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s13015-015-0061-5 ↗
- Languages:
- English
- ISSNs:
- 1748-7188
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9847.xml