DiSBPred: A machine learning based approach for disulfide bond prediction. (April 2021)
- Record Type:
- Journal Article
- Title:
- DiSBPred: A machine learning based approach for disulfide bond prediction. (April 2021)
- Main Title:
- DiSBPred: A machine learning based approach for disulfide bond prediction
- Authors:
- Mishra, Avdesh
Kabir, Md Wasi Ul
Hoque, Md Tamjidul - Abstract:
- Graphical abstract: The workflow of the disulfide bond prediction. Highlights: Highly accurate disulfide bond prediction from sequence alone. Showcase disulfide bond prediction. Stacking based robust machine learning algorithm. Comparisons of machine learning algorithms for disulfide bond prediction. Abstract: The protein disulfide bond is a covalent bond that forms during post-translational modification by the oxidation of a pair of cysteines. In protein, the disulfide bond is the most frequent covalent link between amino acids after the peptide bond. It plays a significant role in three-dimensional (3D) ab initio protein structure prediction ( aiPSP ), stabilizing protein conformation, post-translational modification, and protein folding. In aiPSP, the location of disulfide bonds can strongly reduce the conformational space searching by imposing geometrical constraints. Existing experimental techniques for the determination of disulfide bonds are time-consuming and expensive. Thus, developing sequence-based computational methods for disulfide bond prediction becomes indispensable. This study proposed a stacking-based machine learning approach for disulfide bond prediction ( diSBPred ). Various useful sequence and structure-based features are extracted for effective training, including conservation profile, residue solvent accessibility, torsion angle flexibility, disorder probability, a sequential distance between cysteines, and more. The prediction of disulfide bonds isGraphical abstract: The workflow of the disulfide bond prediction. Highlights: Highly accurate disulfide bond prediction from sequence alone. Showcase disulfide bond prediction. Stacking based robust machine learning algorithm. Comparisons of machine learning algorithms for disulfide bond prediction. Abstract: The protein disulfide bond is a covalent bond that forms during post-translational modification by the oxidation of a pair of cysteines. In protein, the disulfide bond is the most frequent covalent link between amino acids after the peptide bond. It plays a significant role in three-dimensional (3D) ab initio protein structure prediction ( aiPSP ), stabilizing protein conformation, post-translational modification, and protein folding. In aiPSP, the location of disulfide bonds can strongly reduce the conformational space searching by imposing geometrical constraints. Existing experimental techniques for the determination of disulfide bonds are time-consuming and expensive. Thus, developing sequence-based computational methods for disulfide bond prediction becomes indispensable. This study proposed a stacking-based machine learning approach for disulfide bond prediction ( diSBPred ). Various useful sequence and structure-based features are extracted for effective training, including conservation profile, residue solvent accessibility, torsion angle flexibility, disorder probability, a sequential distance between cysteines, and more. The prediction of disulfide bonds is carried out in two stages: first, individual cysteines are predicted as either bonding or non-bonding; second, the cysteine-pairs are predicted as either bonding or non-bonding by including the results from cysteine bonding prediction as a feature. The examination of the relevance of the features employed in this study and the features utilized in the existing nearest neighbor algorithm (NNA) method shows that the features used in this study improve about 7.39 % in jackknife validation balanced accuracy. Moreover, for individual cysteine bonding prediction and cysteine-pair bonding prediction, diSBPred provides a 10-fold cross-validation balanced accuracy of 82.29 % and 94.20 %, respectively. Altogether, our predictor achieves an improvement of 43.25 % based on balanced accuracy compared to the existing NNA based approach. Thus, diSBPred can be utilized to annotate the cysteine bonding residues of protein sequences whose structures are unknown as well as improve the accuracy of the aiPSP method, which can further aid in experimental studies of the disulfide bond and structure determination. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 91(2021)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 91(2021)
- Issue Display:
- Volume 91, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 91
- Issue:
- 2021
- Issue Sort Value:
- 2021-0091-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-04
- Subjects:
- Machine learning -- Disulfide bond prediction -- Protein structure -- Protein sequence
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2021.107436 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 16188.xml