DNN-DTIs: Improved drug-target interactions prediction using XGBoost feature selection and deep neural network. (September 2021)
- Record Type:
- Journal Article
- Title:
- DNN-DTIs: Improved drug-target interactions prediction using XGBoost feature selection and deep neural network. (September 2021)
- Main Title:
- DNN-DTIs: Improved drug-target interactions prediction using XGBoost feature selection and deep neural network
- Authors:
- Chen, Cheng
Shi, Han
Jiang, Zhiwen
Salhi, Adil
Chen, Ruixin
Cui, Xuefeng
Yu, Bin - Abstract:
- Abstract: Analysis and prediction of drug-target interactions (DTIs) play an important role in understanding drug mechanisms, as well as drug repositioning and design. Machine learning (ML)-based methods for DTIs prediction can mitigate the shortcomings of time-consuming and labor-intensive experimental approaches, while providing new ideas and insights for drug design. We propose a novel pipeline for predicting drug-target interactions, called DNN-DTIs. First, the target information is characterized by a number of features, namely, pseudo-amino acid composition, pseudo position-specific scoring matrix, conjoint triad composition, transition and distribution, Moreau-Broto autocorrelation, and structural features. The drug compounds are subsequently encoded using substructure fingerprints. Next, eXtreme gradient boosting (XGBoost) is used to determine the subset of non-redundant features of importance. The optimal balanced set of sample vectors is obtained by applying the synthetic minority oversampling technique (SMOTE). Finally, a DTIs predictor, DNN-DTIs, is developed based on a deep neural network (DNN) via a layer-by-layer learning scheme. Experimental results indicate that DNN-DTIs achieves better performance than other state-of-the-art predictors with ACC values of 98.78%, 98.60%, 97.98%, 98.24% and 98.00% on Enzyme, Ion Channels (IC), GPCR, Nuclear Receptors (NR) and Kuang's datasets. Therefore, the accurate prediction performance of DNN-DTIs makes it a favored choiceAbstract: Analysis and prediction of drug-target interactions (DTIs) play an important role in understanding drug mechanisms, as well as drug repositioning and design. Machine learning (ML)-based methods for DTIs prediction can mitigate the shortcomings of time-consuming and labor-intensive experimental approaches, while providing new ideas and insights for drug design. We propose a novel pipeline for predicting drug-target interactions, called DNN-DTIs. First, the target information is characterized by a number of features, namely, pseudo-amino acid composition, pseudo position-specific scoring matrix, conjoint triad composition, transition and distribution, Moreau-Broto autocorrelation, and structural features. The drug compounds are subsequently encoded using substructure fingerprints. Next, eXtreme gradient boosting (XGBoost) is used to determine the subset of non-redundant features of importance. The optimal balanced set of sample vectors is obtained by applying the synthetic minority oversampling technique (SMOTE). Finally, a DTIs predictor, DNN-DTIs, is developed based on a deep neural network (DNN) via a layer-by-layer learning scheme. Experimental results indicate that DNN-DTIs achieves better performance than other state-of-the-art predictors with ACC values of 98.78%, 98.60%, 97.98%, 98.24% and 98.00% on Enzyme, Ion Channels (IC), GPCR, Nuclear Receptors (NR) and Kuang's datasets. Therefore, the accurate prediction performance of DNN-DTIs makes it a favored choice for contributing to the study of DTIs, especially drug repositioning. Highlights: A novel pipeline called DNN-DTIs for drug-target interactions prediction. We firstly perform multi-information fusion via PseAAC, PsePSSM, CTD, CT, NMBroto, structure feature and molecular fingerprint. The XGBoost is firstly employed to determine optimal features for DTIs prediction. We use deep neural network as classifier to predict DTIs via layer-by-layer learning, which can obtain high-level features. DNN-DTIs model has good generalization ability on DTIs network prediction. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 136(2021)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 136(2021)
- Issue Display:
- Volume 136, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 136
- Issue:
- 2021
- Issue Sort Value:
- 2021-0136-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-09
- Subjects:
- Drug-target interactions -- Multi-information fusion -- XGBoost -- Deep neural network
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2021.104676 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18637.xml