RF-GlutarySite: a random forest based predictor for glutarylation sites. (26th April 2019)
- Record Type:
- Journal Article
- Title:
- RF-GlutarySite: a random forest based predictor for glutarylation sites. (26th April 2019)
- Main Title:
- RF-GlutarySite: a random forest based predictor for glutarylation sites
- Authors:
- AL-barakati, Hussam J.
Saigo, Hiroto
Newman, Robert H.
KC, Dukka B. - Abstract:
- Abstract : Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. Here, we describe the development of RF-GlutarySite, a random forest-based predictor designed to predict glutarylation sites based on protein primary amino acid sequence. Abstract : Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. However, the specific sites of modification on individual proteins, as well as the extent of glutarylation throughout the proteome, remain largely uncharacterized. Though informative, proteomic approaches based on mass spectrometry can be expensive, technically challenging and time-consuming. Therefore, the ability to predict glutarylation sites from protein primary sequences can complement proteomics analyses and help researchers study the characteristics and functional consequences of glutarylation. To this end, we used Random Forest (RF) machine learning strategies to identify the physiochemical and sequence-based features that correlated most substantially with glutarylation. We then used these features to develop a novel method to predict glutarylation sites from primary amino acid sequences using RF. Based on 10-fold cross-validation, the resulting algorithm, termedAbstract : Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. Here, we describe the development of RF-GlutarySite, a random forest-based predictor designed to predict glutarylation sites based on protein primary amino acid sequence. Abstract : Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. However, the specific sites of modification on individual proteins, as well as the extent of glutarylation throughout the proteome, remain largely uncharacterized. Though informative, proteomic approaches based on mass spectrometry can be expensive, technically challenging and time-consuming. Therefore, the ability to predict glutarylation sites from protein primary sequences can complement proteomics analyses and help researchers study the characteristics and functional consequences of glutarylation. To this end, we used Random Forest (RF) machine learning strategies to identify the physiochemical and sequence-based features that correlated most substantially with glutarylation. We then used these features to develop a novel method to predict glutarylation sites from primary amino acid sequences using RF. Based on 10-fold cross-validation, the resulting algorithm, termed 'RF-GlutarySite', achieved efficiency scores of 75%, 81%, 68% and 0.50 with respect to accuracy (ACC), sensitivity (SN), specificity (SP) and Matthew's correlation coefficient (MCC), respectively. Likewise, using an independent test set, RF-GlutarySite exhibited ACC, SN, SP and MCC scores of 72%, 73%, 70% and 0.43, respectively. Results using both 10-fold cross validation and an independent test set were on par with or better than those achieved by existing glutarylation site predictors. Notably, RF-GlutarySite achieved the highest SN score among available glutarylation site prediction tools. Consequently, our method has the potential to uncover new glutarylation sites and to facilitate the discovery of relationships between glutarylation and well-known lysine modifications, such as acetylation, methylation and SUMOylation, as well as a number of recently identified lysine modifications, such as malonylation and succinylation. … (more)
- Is Part Of:
- Molecular omics. Volume 15:Number 3(2019)
- Journal:
- Molecular omics
- Issue:
- Volume 15:Number 3(2019)
- Issue Display:
- Volume 15, Issue 3 (2019)
- Year:
- 2019
- Volume:
- 15
- Issue:
- 3
- Issue Sort Value:
- 2019-0015-0003-0000
- Page Start:
- 189
- Page End:
- 204
- Publication Date:
- 2019-04-26
- Subjects:
- Molecular biology -- Periodicals
Biochemistry -- Periodicals
Biological systems -- Periodicals
Molecular Biology
Computational Biology
Biochemistry
Biological systems
Molecular biology
Periodicals
Electronic journals
Periodicals
Fulltext
Internet Resources
Periodicals - Journal URLs:
- http://www.rsc.org/journals-books-databases/about-journals/molecular-omics/ ↗
http://pubs.rsc.org/en/journals/journalissues/mo#!recentarticles&adv ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/c9mo00028c ↗
- Languages:
- English
- ISSNs:
- 2515-4184
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 9838.212612
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10843.xml