Predicting the evolution of number of native contacts of a small protein by using deep learning approach. (April 2022)
- Record Type:
- Journal Article
- Title:
- Predicting the evolution of number of native contacts of a small protein by using deep learning approach. (April 2022)
- Main Title:
- Predicting the evolution of number of native contacts of a small protein by using deep learning approach
- Authors:
- Santra, Santanu
Jana, Madhurima - Abstract:
- Abstract: Native contacts (NCs) are one of the most vital parameters in order to define the resemblance of a protein conformation with its native state. Prediction of number of native contacts in a protein is useful in protein folding mechanism. In this work, we focused to predict the time series of the number of NCs of a small protein, insulin monomer by using three neural network based models, namely; Multi-Layer Perceptron (MLP), Long Short Term Memory (LSTM) and Gated Recurrent Unit (GRU). The input data used in the study was the time evolution of NC values of the folded and unfolded protein conformations computed from the equilibrated trajectories of atomistic molecular dynamics (MD) simulations performed with the aqueous solution of the protein at ambient as well as at an elevated temperature. The evolutionary prediction accuracy of the three models was tested by calculating two error parameters; Root Mean Square Error (RMSE) and Mean Absolute Error (MAE). Our study revealed that, although these three models are successful in forecasting the time evolutions of the NCs in terms of lower RMSE and MAE, the prediction through static memoryless artificial neural network, MLP was relatively less precise as compared to other two recurrent units, LSTM and GRU. The study infers that by using the available input data generated from the MD trajectories; these neural network based models could be used to predict the complex evolution pattern of distanced based structuralAbstract: Native contacts (NCs) are one of the most vital parameters in order to define the resemblance of a protein conformation with its native state. Prediction of number of native contacts in a protein is useful in protein folding mechanism. In this work, we focused to predict the time series of the number of NCs of a small protein, insulin monomer by using three neural network based models, namely; Multi-Layer Perceptron (MLP), Long Short Term Memory (LSTM) and Gated Recurrent Unit (GRU). The input data used in the study was the time evolution of NC values of the folded and unfolded protein conformations computed from the equilibrated trajectories of atomistic molecular dynamics (MD) simulations performed with the aqueous solution of the protein at ambient as well as at an elevated temperature. The evolutionary prediction accuracy of the three models was tested by calculating two error parameters; Root Mean Square Error (RMSE) and Mean Absolute Error (MAE). Our study revealed that, although these three models are successful in forecasting the time evolutions of the NCs in terms of lower RMSE and MAE, the prediction through static memoryless artificial neural network, MLP was relatively less precise as compared to other two recurrent units, LSTM and GRU. The study infers that by using the available input data generated from the MD trajectories; these neural network based models could be used to predict the complex evolution pattern of distanced based structural parameters of a protein with a satisfactory level. Graphical Abstract: ga1 Highlights: Time series forecasting of native contacts of insulin in aqueous solution at 300 K and 400 K. LSTM performs better than GRU and MLP. Time lag of 8 produces better prediction accuracy. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 97(2022)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 97(2022)
- Issue Display:
- Volume 97, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 97
- Issue:
- 2022
- Issue Sort Value:
- 2022-0097-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-04
- Subjects:
- Machine learning -- Protein -- Native contact -- Molecular simulation -- Time series prediction
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2022.107625 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21007.xml