A representation transfer learning approach for enhanced prediction of growth hormone binding proteins. (August 2020)
- Record Type:
- Journal Article
- Title:
- A representation transfer learning approach for enhanced prediction of growth hormone binding proteins. (August 2020)
- Main Title:
- A representation transfer learning approach for enhanced prediction of growth hormone binding proteins
- Authors:
- Yadav, Amisha
Sahu, Roopshikha
Nath, Abhigyan - Abstract:
- Graphical abstract: Highlights: Application of representation transfer learning. Enhanced prediction accuracy for GHBPs. Compressed and informative representation of the input data. Comparison of different types of feature representation. Abstract: Growth hormone binding proteins (GHBPs) are soluble proteins that play an important role in the modulation of signaling pathways pertaining to growth hormones. GHBPs are selective and bind non-covalently with growth hormones, but their functions are still not fully understood. Identification and characterization of GHBPs are the preliminary steps for understanding their roles in various cellular processes. As wet lab based experimental methods involve high cost and labor, computational methods can facilitate in narrowing down the search space of putative GHBPs. Performance of machine learning algorithms largely depends on the quality of features that it feeds on. Informative and non-redundant features generally result in enhanced performance and for this purpose feature selection algorithms are commonly used. In the present work, a novel representation transfer learning approach is presented for prediction of GHBPs. For their accurate prediction, deep autoencoder based features were extracted and subsequently SMO-PolyK classifier is trained. The prediction model is evaluated by both leave one out cross validation (LOOCV) and hold out independent testing set. On LOOCV, the prediction model achieved 89.8%% accuracy, with 89.4%Graphical abstract: Highlights: Application of representation transfer learning. Enhanced prediction accuracy for GHBPs. Compressed and informative representation of the input data. Comparison of different types of feature representation. Abstract: Growth hormone binding proteins (GHBPs) are soluble proteins that play an important role in the modulation of signaling pathways pertaining to growth hormones. GHBPs are selective and bind non-covalently with growth hormones, but their functions are still not fully understood. Identification and characterization of GHBPs are the preliminary steps for understanding their roles in various cellular processes. As wet lab based experimental methods involve high cost and labor, computational methods can facilitate in narrowing down the search space of putative GHBPs. Performance of machine learning algorithms largely depends on the quality of features that it feeds on. Informative and non-redundant features generally result in enhanced performance and for this purpose feature selection algorithms are commonly used. In the present work, a novel representation transfer learning approach is presented for prediction of GHBPs. For their accurate prediction, deep autoencoder based features were extracted and subsequently SMO-PolyK classifier is trained. The prediction model is evaluated by both leave one out cross validation (LOOCV) and hold out independent testing set. On LOOCV, the prediction model achieved 89.8%% accuracy, with 89.4% sensitivity and 90.2% specificity and accuracy of 93.5%, sensitivity of 90.2% and specificity of 96.8% is attained on the hold out testing set. Further a comparison was made between the full set of sequence-based features, top performing sequence features extracted using feature selection algorithm, deep autoencoder based features and generalized low rank model based features on the prediction accuracy. Principal component analysis of the representative features along with t-sne visualization demonstrated the effectiveness of deep features in prediction of GHBPs. The present method is robust and accurate and may complement other wet lab based methods for identification of novel GHBPs. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 87(2020)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 87(2020)
- Issue Display:
- Volume 87, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 87
- Issue:
- 2020
- Issue Sort Value:
- 2020-0087-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08
- Subjects:
- Growth hormone binding proteins -- Autoencoders -- Feature selection -- SMO-PolyK -- Generalized low rank models -- Principal component analysis -- t-sne
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2020.107274 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 13572.xml