THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites. Issue 11 (15th June 2022)
- Record Type:
- Journal Article
- Title:
- THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites. Issue 11 (15th June 2022)
- Main Title:
- THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
- Authors:
- Shoombuatong, Watshara
Basith, Shaherin
Pitti, Thejkiran
Lee, Gwang
Manavalan, Balachandran - Abstract:
- Graphical abstract: Highlights: THRONE is a novel three-layer ensemble method for accurate prediction of human m7G sites from RNA sequence information. Expl Exploring different computational frameworks using nine different encodings and six different machine learning classifiers. Extensive benchmarking test shows that THRONE achieves superior performance in m7G site prediction compared to the publicly available state-of-the-art predictors. Abstract: N 7 -methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A t hree-layer ensemble predictor for identifying h uman R NA N7-methylguano sin e site s) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were createdGraphical abstract: Highlights: THRONE is a novel three-layer ensemble method for accurate prediction of human m7G sites from RNA sequence information. Expl Exploring different computational frameworks using nine different encodings and six different machine learning classifiers. Extensive benchmarking test shows that THRONE achieves superior performance in m7G site prediction compared to the publicly available state-of-the-art predictors. Abstract: N 7 -methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A t hree-layer ensemble predictor for identifying h uman R NA N7-methylguano sin e site s) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis. … (more)
- Is Part Of:
- Journal of molecular biology. Volume 434:Issue 11(2022)
- Journal:
- Journal of molecular biology
- Issue:
- Volume 434:Issue 11(2022)
- Issue Display:
- Volume 434, Issue 11 (2022)
- Year:
- 2022
- Volume:
- 434
- Issue:
- 11
- Issue Sort Value:
- 2022-0434-0011-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06-15
- Subjects:
- RNA N7-methylguanosine sites -- sequence analysis -- bioinformatics -- ensemble learning -- machine learning
Molecular biology -- Periodicals
Biology -- Periodicals
Biochemistry -- Periodicals
Bacteriology -- Periodicals
Molecular Biology -- Periodicals
Biochemistry -- Periodicals
Biologie moléculaire -- Périodiques
Biologie -- Périodiques
Biochimie -- Périodiques
Moleculaire biologie
Biochemistry
Biology
Molecular biology
Periodicals
572.805 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00222836 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.jmb.2022.167549 ↗
- Languages:
- English
- ISSNs:
- 0022-2836
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5020.700000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21756.xml