Refinement of pairwise potentials via logistic regression to score protein‐protein interactions. Issue 12 (30th July 2020)
- Record Type:
- Journal Article
- Title:
- Refinement of pairwise potentials via logistic regression to score protein‐protein interactions. Issue 12 (30th July 2020)
- Main Title:
- Refinement of pairwise potentials via logistic regression to score protein‐protein interactions
- Authors:
- Tanemura, Kiyoto A.
Pei, Jun
Merz, Kenneth M. - Abstract:
- Abstract: Protein‐protein interactions (PPIs) are ubiquitous and functionally of great importance in biological systems. Hence, the accurate prediction of PPIs by protein‐protein docking and scoring tools is highly desirable in order to characterize their structure and biological function. Ab initio docking protocols are divided into the sampling of docking poses to produce at least one near‐native structure, and then to evaluate the vast candidate structures by scoring. Concurrent development in both sampling and scoring is crucial for the deployment of protein‐protein docking software. In the present work, we apply a machine learning model on pairwise potentials to refine the task of protein quaternary structure native structure detection among decoys. A decoy set was featurized using the Knowledge and Empirical Combined Scoring Algorithm 2 (KECSA2) pairwise potential. The highly unbalanced decoy set was then balanced using a comparison concept between native and decoy structures. The resultant comparison descriptors were used to train a logistic regression (LR) classifier. The LR model yielded the optimal performance for native detection among decoys compared with conventional scoring functions, while exhibiting lesser performance for the detection of low root mean square deviation decoy structures. Its deployment on an independent benchmark set confirms that the scoring function performs competitively relative to other scoring functions. The scripts used are available atAbstract: Protein‐protein interactions (PPIs) are ubiquitous and functionally of great importance in biological systems. Hence, the accurate prediction of PPIs by protein‐protein docking and scoring tools is highly desirable in order to characterize their structure and biological function. Ab initio docking protocols are divided into the sampling of docking poses to produce at least one near‐native structure, and then to evaluate the vast candidate structures by scoring. Concurrent development in both sampling and scoring is crucial for the deployment of protein‐protein docking software. In the present work, we apply a machine learning model on pairwise potentials to refine the task of protein quaternary structure native structure detection among decoys. A decoy set was featurized using the Knowledge and Empirical Combined Scoring Algorithm 2 (KECSA2) pairwise potential. The highly unbalanced decoy set was then balanced using a comparison concept between native and decoy structures. The resultant comparison descriptors were used to train a logistic regression (LR) classifier. The LR model yielded the optimal performance for native detection among decoys compared with conventional scoring functions, while exhibiting lesser performance for the detection of low root mean square deviation decoy structures. Its deployment on an independent benchmark set confirms that the scoring function performs competitively relative to other scoring functions. The scripts used are available at https://github.com/TanemuraKiyoto/PPI-native-detection-via-LR . … (more)
- Is Part Of:
- Proteins. Volume 88:Issue 12(2020)
- Journal:
- Proteins
- Issue:
- Volume 88:Issue 12(2020)
- Issue Display:
- Volume 88, Issue 12 (2020)
- Year:
- 2020
- Volume:
- 88
- Issue:
- 12
- Issue Sort Value:
- 2020-0088-0012-0000
- Page Start:
- 1559
- Page End:
- 1568
- Publication Date:
- 2020-07-30
- Subjects:
- docking refinement -- machine learning -- protein‐protein docking -- quaternary structure prediction -- scoring function
Proteins -- Periodicals
Proteins -- Periodicals
572.6 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/prot.25973 ↗
- Languages:
- English
- ISSNs:
- 0887-3585
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6936.164000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14704.xml