Markovian encoding models in human splice site recognition using SVM. (April 2018)
- Record Type:
- Journal Article
- Title:
- Markovian encoding models in human splice site recognition using SVM. (April 2018)
- Main Title:
- Markovian encoding models in human splice site recognition using SVM
- Authors:
- Pashaei, Elham
Aydin, Nizamettin - Abstract:
- Highlights: We have provided a new precise evaluation of Markovian encoding approaches in Human splice site prediction domain. We have proposed a novel approach for human splice site recognition . The developed approach achieved higher prediction accuracy than thirteen current state-of-arts methods and several existing tools. An online server MMSVM has been developed for predicting Human splice sites. Abstract: Splice site recognition is among the most significant and challenging tasks in bioinformatics due to its key role in gene annotation. Effective prediction of splice site requires nucleotide encoding methods that reveal the characteristics of DNA sequences to provide appropriate features to serve as input of machine learning classifiers. Markovian models are the most influential encoding methods that highly used for pattern recognition in biological data. However, a direct performance comparison of these methods in splice site domain has not been assessed yet. This study compares various Markovian encoding models for splice site prediction utilizing support vector machine, as the most outstanding learning method in the domain, and conducts a new precise evaluation of Markovian approaches that corrects this limitation. Moreover, a novel sequence encoding approach based on third order Markov model (MM3) is proposed. The experimental results show that the proposed method, namely MM3-SVM, performs significantly better than thirteen best known state-of-the-art algorithms,Highlights: We have provided a new precise evaluation of Markovian encoding approaches in Human splice site prediction domain. We have proposed a novel approach for human splice site recognition . The developed approach achieved higher prediction accuracy than thirteen current state-of-arts methods and several existing tools. An online server MMSVM has been developed for predicting Human splice sites. Abstract: Splice site recognition is among the most significant and challenging tasks in bioinformatics due to its key role in gene annotation. Effective prediction of splice site requires nucleotide encoding methods that reveal the characteristics of DNA sequences to provide appropriate features to serve as input of machine learning classifiers. Markovian models are the most influential encoding methods that highly used for pattern recognition in biological data. However, a direct performance comparison of these methods in splice site domain has not been assessed yet. This study compares various Markovian encoding models for splice site prediction utilizing support vector machine, as the most outstanding learning method in the domain, and conducts a new precise evaluation of Markovian approaches that corrects this limitation. Moreover, a novel sequence encoding approach based on third order Markov model (MM3) is proposed. The experimental results show that the proposed method, namely MM3-SVM, performs significantly better than thirteen best known state-of-the-art algorithms, while tested on HS3D dataset considering several performance criteria. Further, it achieved higher prediction accuracy than several well-known tools like NNsplice, MEM, MM1, WMM, and GeneID, using an independent test set of 50 genes. We also developed MMSVM, a web tool to predict splice sites in any human sequence using the proposed approach. The MMSVM web server can be assessed at https://pashaei.shinyapps.io/mmsvm . … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 73(2018)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 73(2018)
- Issue Display:
- Volume 73, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 73
- Issue:
- 2018
- Issue Sort Value:
- 2018-0073-2018-0000
- Page Start:
- 159
- Page End:
- 170
- Publication Date:
- 2018-04
- Subjects:
- Markovian model -- Splice sites -- Machine learning -- DNA encoding method -- MMSVM
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2018.02.005 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20965.xml