Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning. Issue 1 (December 2018)
- Record Type:
- Journal Article
- Title:
- Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning. Issue 1 (December 2018)
- Main Title:
- Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning
- Authors:
- Zheng, Jinfang
Zhang, Xiaoli
Zhao, Xunyi
Tong, Xiaoxue
Hong, Xu
Xie, Juan
Liu, Shiyong - Abstract:
- Abstract RNA binding protein (RBP) plays an important role in cellular processes. Identifying RBPs by computation and experiment are both essential. Recently, an RBP predictor, RBPPred, is proposed in our group to predict RBPs. However, RBPPred is too slow for that it needs to generate PSSM matrix as its feature. Herein, based on the protein feature of RBPPred and Convolutional Neural Network (CNN), we develop a deep learning model called Deep-RBPPred. With the balance and imbalance training set, we obtain Deep-RBPPred-balance and Deep-RBPPred-imbalance models. Deep-RBPPred has three advantages comparing to previous methods. (1) Deep-RBPPred only needs few physicochemical properties based on protein sequences. (2) Deep-RBPPred runs much faster. (3) Deep-RBPPred has a good generalization ability. In the meantime, Deep-RBPPred is still as good as the state-of-the-art method. Testing in A. thaliana, S. cerevisiae and H. sapiens proteomes, MCC values are 0.82 (0.82), 0.65 (0.69) and 0.85 (0.80) for balance model (imbalance model) when the score cutoff is set to 0.5, respectively. In the same testing dataset, different machine learning algorithms (CNN and SVM) are also compared. The results show that CNN-based model can identify more RBPs than SVM-based. In comparing the balance and imbalance model, both CNN-base and SVM-based tend to favor the majority class in the imbalance set. Deep-RBPPred forecasts 280 (balance model) and 265 (imbalance model) of 299 new RBP. The sensitivityAbstract RNA binding protein (RBP) plays an important role in cellular processes. Identifying RBPs by computation and experiment are both essential. Recently, an RBP predictor, RBPPred, is proposed in our group to predict RBPs. However, RBPPred is too slow for that it needs to generate PSSM matrix as its feature. Herein, based on the protein feature of RBPPred and Convolutional Neural Network (CNN), we develop a deep learning model called Deep-RBPPred. With the balance and imbalance training set, we obtain Deep-RBPPred-balance and Deep-RBPPred-imbalance models. Deep-RBPPred has three advantages comparing to previous methods. (1) Deep-RBPPred only needs few physicochemical properties based on protein sequences. (2) Deep-RBPPred runs much faster. (3) Deep-RBPPred has a good generalization ability. In the meantime, Deep-RBPPred is still as good as the state-of-the-art method. Testing in A. thaliana, S. cerevisiae and H. sapiens proteomes, MCC values are 0.82 (0.82), 0.65 (0.69) and 0.85 (0.80) for balance model (imbalance model) when the score cutoff is set to 0.5, respectively. In the same testing dataset, different machine learning algorithms (CNN and SVM) are also compared. The results show that CNN-based model can identify more RBPs than SVM-based. In comparing the balance and imbalance model, both CNN-base and SVM-based tend to favor the majority class in the imbalance set. Deep-RBPPred forecasts 280 (balance model) and 265 (imbalance model) of 299 new RBP. The sensitivity of balance model is about 7% higher than the state-of-the-art method. We also apply deep-RBPPred to 30 eukaryotes and 109 bacteria proteomes downloaded from Uniprot to estimate all possible RBPs. The estimating result shows that rates of RBPs in eukaryote proteomes are much higher than bacteria proteomes. … (more)
- Is Part Of:
- Scientific reports. Volume 8:Issue 1(2018)
- Journal:
- Scientific reports
- Issue:
- Volume 8:Issue 1(2018)
- Issue Display:
- Volume 8, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 8
- Issue:
- 1
- Issue Sort Value:
- 2018-0008-0001-0000
- Page Start:
- 1
- Page End:
- 9
- Publication Date:
- 2018-12
- Subjects:
- Natural history -- Research -- Periodicals
Biology -- Research -- Periodicals
Physical sciences -- Research -- Periodicals
Earth sciences -- Research -- Periodicals
Environmental sciences -- Research -- Periodicals
502.85 - Journal URLs:
- http://www.nature.com/ ↗
http://www.nature.com/srep/index.html ↗ - DOI:
- 10.1038/s41598-018-33654-x ↗
- Languages:
- English
- ISSNs:
- 2045-2322
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10715.xml