A survey of algorithms for transforming molecular dynamics data into metadata for in situ analytics based on machine learning methods. (20th January 2020)
- Record Type:
- Journal Article
- Title:
- A survey of algorithms for transforming molecular dynamics data into metadata for in situ analytics based on machine learning methods. (20th January 2020)
- Main Title:
- A survey of algorithms for transforming molecular dynamics data into metadata for in situ analytics based on machine learning methods
- Authors:
- Taufer, Michela
Estrada, Trilce
Johnston, Travis - Abstract:
- Abstract : This paper presents the survey of three algorithms to transform atomic-level molecular snapshots from molecular dynamics (MD) simulations into metadata representations that are suitable for in situ analytics based on machine learning methods. MD simulations studying the classical time evolution of a molecular system at atomic resolution are widely recognized in the fields of chemistry, material sciences, molecular biology and drug design; these simulations are one of the most common simulations on supercomputers. Next-generation supercomputers will have a dramatically higher performance than current systems, generating more data that needs to be analysed (e.g. in terms of number and length of MD trajectories). In the future, the coordination of data generation and analysis can no longer rely on manual, centralized analysis traditionally performed after the simulation is completed or on current data representations that have been defined for traditional visualization tools. Powerful data preparation phases (i.e. phases in which original row data is transformed to concise and still meaningful representations) will need to proceed data analysis phases. Here, we discuss three algorithms for transforming traditionally used molecular representations into concise and meaningful metadata representations. The transformations can be performed locally. The new metadata can be fed into machine learning methods for runtime in situ analysis of larger MD trajectories supportedAbstract : This paper presents the survey of three algorithms to transform atomic-level molecular snapshots from molecular dynamics (MD) simulations into metadata representations that are suitable for in situ analytics based on machine learning methods. MD simulations studying the classical time evolution of a molecular system at atomic resolution are widely recognized in the fields of chemistry, material sciences, molecular biology and drug design; these simulations are one of the most common simulations on supercomputers. Next-generation supercomputers will have a dramatically higher performance than current systems, generating more data that needs to be analysed (e.g. in terms of number and length of MD trajectories). In the future, the coordination of data generation and analysis can no longer rely on manual, centralized analysis traditionally performed after the simulation is completed or on current data representations that have been defined for traditional visualization tools. Powerful data preparation phases (i.e. phases in which original row data is transformed to concise and still meaningful representations) will need to proceed data analysis phases. Here, we discuss three algorithms for transforming traditionally used molecular representations into concise and meaningful metadata representations. The transformations can be performed locally. The new metadata can be fed into machine learning methods for runtime in situ analysis of larger MD trajectories supported by high-performance computing. In this paper, we provide an overview of the three algorithms and their use for three different applications: protein–ligand docking in drug design; protein folding simulations; and protein engineering based on analytics of protein functions depending on proteins' three-dimensional structures. This article is part of a discussion meeting issue 'Numerical algorithms for high-performance computational science'. … (more)
- Is Part Of:
- Philosophical transactions. Volume 378:Number 2166(2020)
- Journal:
- Philosophical transactions
- Issue:
- Volume 378:Number 2166(2020)
- Issue Display:
- Volume 378, Issue 2166 (2020)
- Year:
- 2020
- Volume:
- 378
- Issue:
- 2166
- Issue Sort Value:
- 2020-0378-2166-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-01-20
- Subjects:
- protein–ligand docking -- protein folding -- protein engineering -- machine learning -- MapReduce
Physical sciences -- Periodicals
Engineering -- Periodicals
Mathematics -- Periodicals
500 - Journal URLs:
- https://royalsocietypublishing.org/loi/rsta ↗
- DOI:
- 10.1098/rsta.2019.0063 ↗
- Languages:
- English
- ISSNs:
- 1364-503X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library STI - ELD Digital store
- Ingest File:
- 12788.xml