A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties. Issue 22 (28th March 2018)
- Record Type:
- Journal Article
- Title:
- A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties. Issue 22 (28th March 2018)
- Main Title:
- A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
- Authors:
- Kundu, Indra
Paul, Goutam
Banerjee, Raja - Abstract:
- Abstract : There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. Abstract : There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. We have tried to implement Machine Learning (ML) algorithm models on the protein–ligand binding affinity data already available to predict the binding affinity of the unknown. ML methods are appreciably faster and cheaper as compared to traditional experimental methods or computational scoring approaches. The prerequisites of this prediction are sufficient and unbiased features of training data and a prediction model which can fit the data well. In our study, we have applied Random forest and Gaussian process regression algorithms from the Weka package on protein–ligand binding affinity, which encompasses protein and ligand binding information from PdbBind database. The models are trained on the basis of selective fundamental information of both proteins and ligand, which can be effortlessly fetched from online databases or can be calculated with the availability of structure. The assessment of the models was made on the basis of correlation coefficient ( R 2 ) and root mean square error (RMSE). The Random forest model gave R 2 and RMSE of 0.76 and 1.31 respectively. We have also used our features and prediction models on the dataset used by others and found that our modelAbstract : There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. Abstract : There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. We have tried to implement Machine Learning (ML) algorithm models on the protein–ligand binding affinity data already available to predict the binding affinity of the unknown. ML methods are appreciably faster and cheaper as compared to traditional experimental methods or computational scoring approaches. The prerequisites of this prediction are sufficient and unbiased features of training data and a prediction model which can fit the data well. In our study, we have applied Random forest and Gaussian process regression algorithms from the Weka package on protein–ligand binding affinity, which encompasses protein and ligand binding information from PdbBind database. The models are trained on the basis of selective fundamental information of both proteins and ligand, which can be effortlessly fetched from online databases or can be calculated with the availability of structure. The assessment of the models was made on the basis of correlation coefficient ( R 2 ) and root mean square error (RMSE). The Random forest model gave R 2 and RMSE of 0.76 and 1.31 respectively. We have also used our features and prediction models on the dataset used by others and found that our model with our features outperformed the existing ones. … (more)
- Is Part Of:
- RSC advances. Volume 8:Issue 22(2018)
- Journal:
- RSC advances
- Issue:
- Volume 8:Issue 22(2018)
- Issue Display:
- Volume 8, Issue 22 (2018)
- Year:
- 2018
- Volume:
- 8
- Issue:
- 22
- Issue Sort Value:
- 2018-0008-0022-0000
- Page Start:
- 12127
- Page End:
- 12137
- Publication Date:
- 2018-03-28
- Subjects:
- Chemistry -- Periodicals
540.5 - Journal URLs:
- http://pubs.rsc.org/en/Journals/JournalIssues/RA ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/c8ra00003d ↗
- Languages:
- English
- ISSNs:
- 2046-2069
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8036.750300
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 6153.xml