Prediction of protein-protein interactions based on ensemble residual convolutional neural network. (January 2023)
- Record Type:
- Journal Article
- Title:
- Prediction of protein-protein interactions based on ensemble residual convolutional neural network. (January 2023)
- Main Title:
- Prediction of protein-protein interactions based on ensemble residual convolutional neural network
- Authors:
- Gao, Hongli
Chen, Cheng
Li, Shuangyi
Wang, Congjing
Zhou, Weifeng
Yu, Bin - Abstract:
- Abstract: With the development of bio-medical big data, the prediction of protein-protein interactions (PPIs) with the help of deep learning (DL) has attracted much attention for the study of intermolecular mechanism, drug design, human disease treatment. Given that the experiment-based methods can be difficult, reliable and DL-based approaches are needed. In this paper, we develop the EResCNN, an effective predictor to predict PPIs based on ensemble residual convolutional neural network. First, the fused feature representation is captured by concatenating the vectors obtained via pseudo amino acid composition (PseAAC), auto covariance descriptor (AC), pseudo position-specific scoring matrix (PsePSSM), encoding based on grouped weight (EBGW), multivariate mutual information (MMI) and conjoint triad (CT). Then the high-level information can be obtained using the convolution and pooling of the residual convolutional neural network (RCNN) via layer-by-layer learning. At last, we ensemble RCNN, XGBoost, random forest, LightGBM and extremely randomized trees to build the EResCNN model. The predictive results indicate that EResCNN achieves better performance with the ACC values of 95.34%, 87.89% and 98.61% on S. cerevisiae, H. pylori and Human-Y. pestis datasets, respectively. We also apply EResCNN to the datasets of H. sapiens, M. musculus, C. elegans and E. coli for cross-species prediction. Especially, we find EResCNN can infer significant PPIs network on one-core network,Abstract: With the development of bio-medical big data, the prediction of protein-protein interactions (PPIs) with the help of deep learning (DL) has attracted much attention for the study of intermolecular mechanism, drug design, human disease treatment. Given that the experiment-based methods can be difficult, reliable and DL-based approaches are needed. In this paper, we develop the EResCNN, an effective predictor to predict PPIs based on ensemble residual convolutional neural network. First, the fused feature representation is captured by concatenating the vectors obtained via pseudo amino acid composition (PseAAC), auto covariance descriptor (AC), pseudo position-specific scoring matrix (PsePSSM), encoding based on grouped weight (EBGW), multivariate mutual information (MMI) and conjoint triad (CT). Then the high-level information can be obtained using the convolution and pooling of the residual convolutional neural network (RCNN) via layer-by-layer learning. At last, we ensemble RCNN, XGBoost, random forest, LightGBM and extremely randomized trees to build the EResCNN model. The predictive results indicate that EResCNN achieves better performance with the ACC values of 95.34%, 87.89% and 98.61% on S. cerevisiae, H. pylori and Human-Y. pestis datasets, respectively. We also apply EResCNN to the datasets of H. sapiens, M. musculus, C. elegans and E. coli for cross-species prediction. Especially, we find EResCNN can infer significant PPIs network on one-core network, Wnt-related signal pathway network, cancer-specific network and multi-core network, which could provide some references for signal pathway research, disease-related gene mining, and interaction network topology. Highlights: A novel method (EResCNN) is proposed to predict protein-protein interactions. Fusing the PseAAC, AC, PsePSSM, EBGW, MMI and CT methods to extract feature information. Residual convolutional neural network is employed to mine high-level features of PPIs. We combined RCNN with XGBoost, LightGBM, random forest and extreme random tree for ensemble learning. EResCNN has good generalization ability on independent test sets and PPIs network datasets. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 152(2023)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 152(2023)
- Issue Display:
- Volume 152, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 152
- Issue:
- 2023
- Issue Sort Value:
- 2023-0152-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-01
- Subjects:
- Protein-protein interactions -- Multi-information fusion -- Residual convolutional neural network -- Ensemble learning
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2022.106471 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24845.xml