A comprehensive comparison and analysis of computational predictors for RNA N6-methyladenosine sites of Saccharomyces cerevisiae. (14th October 2019)
- Record Type:
- Journal Article
- Title:
- A comprehensive comparison and analysis of computational predictors for RNA N6-methyladenosine sites of Saccharomyces cerevisiae. (14th October 2019)
- Main Title:
- A comprehensive comparison and analysis of computational predictors for RNA N6-methyladenosine sites of Saccharomyces cerevisiae
- Authors:
- Zhu, Xiaolei
He, Jingjing
Zhao, Shihao
Tao, Wei
Xiong, Yi
Bi, Shoudong - Abstract:
- Abstract: N 6 -methyladenosine (m6A) modification, as one of the commonest post-transcription modifications in RNAs, has been reported to be highly related to many biological processes. Over the past decade, several tools for m6A sites prediction of Saccharomyces cerevisiae have been developed and are freely available online. However, the quality of predictions by these tools is difficult to quantify and compare. In this study, an independent dataset M6Atest6540 was compiled to systematically evaluate nine publicly available m6A prediction tools for S. cerevisiae . The experimental results indicate that RAM-ESVM achieved the best performance on M6Atest6540; however, most models performed substantially worse than their performances reported in the original papers. The benchmark dataset Met2614, which was used as the training dataset for the nine methods, were further analyzed by using a position bias index. The results demonstrated the significantly different bias of dataset Met2614 compared with the RNA segments around m6A sites recorded in RMBase. Moreover, newMet2614 was collected by randomly selecting RNA segments from non-redundant data recorded in RMBase, and three different kinds of features were extracted. The performances of the models built on Met2614 and newMet2614 with the features were compared, which shows the better generalization of models built on newMet2614. Our results also indicate the position-specific propensity-based features outperform other features,Abstract: N 6 -methyladenosine (m6A) modification, as one of the commonest post-transcription modifications in RNAs, has been reported to be highly related to many biological processes. Over the past decade, several tools for m6A sites prediction of Saccharomyces cerevisiae have been developed and are freely available online. However, the quality of predictions by these tools is difficult to quantify and compare. In this study, an independent dataset M6Atest6540 was compiled to systematically evaluate nine publicly available m6A prediction tools for S. cerevisiae . The experimental results indicate that RAM-ESVM achieved the best performance on M6Atest6540; however, most models performed substantially worse than their performances reported in the original papers. The benchmark dataset Met2614, which was used as the training dataset for the nine methods, were further analyzed by using a position bias index. The results demonstrated the significantly different bias of dataset Met2614 compared with the RNA segments around m6A sites recorded in RMBase. Moreover, newMet2614 was collected by randomly selecting RNA segments from non-redundant data recorded in RMBase, and three different kinds of features were extracted. The performances of the models built on Met2614 and newMet2614 with the features were compared, which shows the better generalization of models built on newMet2614. Our results also indicate the position-specific propensity-based features outperform other features, although they are also easily over-fitted on a biased dataset. … (more)
- Is Part Of:
- Briefings in functional genomics. Volume 18:Number 6(2019)
- Journal:
- Briefings in functional genomics
- Issue:
- Volume 18:Number 6(2019)
- Issue Display:
- Volume 18, Issue 6 (2019)
- Year:
- 2019
- Volume:
- 18
- Issue:
- 6
- Issue Sort Value:
- 2019-0018-0006-0000
- Page Start:
- 367
- Page End:
- 376
- Publication Date:
- 2019-10-14
- Subjects:
- N6-methyladenosine sites -- computational predictor -- dataset bias -- position-specific propensity -- web servers
Genomics -- Methodology -- Periodicals
Genomics -- Technological innovations -- Periodicals
572.86072 - Journal URLs:
- http://bfg.oxfordjournals.org ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/bfgp/elz018 ↗
- Languages:
- English
- ISSNs:
- 2041-2649
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958366
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12541.xml