"iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition. Issue 128 (May 2016)
- Record Type:
- Journal Article
- Title:
- "iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition. Issue 128 (May 2016)
- Main Title:
- "iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition
- Authors:
- Iqbal, Muhammad
Hayat, Maqsood - Abstract:
- Highlights: "iSS-Hyb-mRMR" model is proposed for identification of splicing sites. Trinucleotide and tetranucleotide composition are used as feature extraction schemes. Hybrid space is formed by using TNC and TetraNC spaces. Various classification algorithms are analyzed. mRMR is utilized to reduce feature space. Abstract: Background and objectives: Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. Methods: In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was testedHighlights: "iSS-Hyb-mRMR" model is proposed for identification of splicing sites. Trinucleotide and tetranucleotide composition are used as feature extraction schemes. Hybrid space is formed by using TNC and TetraNC spaces. Various classification algorithms are analyzed. mRMR is utilized to reduce feature space. Abstract: Background and objectives: Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. Methods: In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was tested using various classification algorithms including K-nearest neighbor, probabilistic neural network, general regression neural network, and fitting network. Jackknife test was used for evaluation of its performance on two benchmark datasets S 1 and S 2, respectively. Results: The predictor, proposed in the current study achieved an accuracy of 93.26%, sensitivity of 88.77%, and specificity of 97.78% for S 1, and the accuracy of 94.12%, sensitivity of 87.14%, and specificity of 98.64% for S 2, respectively. Conclusion: It is observed, that the performance of proposed model is higher than the existing methods in the literature so for; and will be fruitful in the mechanism of RNA splicing, and other research academia. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Issue 128(2016)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Issue 128(2016)
- Issue Display:
- Volume 128, Issue 128 (2016)
- Year:
- 2016
- Volume:
- 128
- Issue:
- 128
- Issue Sort Value:
- 2016-0128-0128-0000
- Page Start:
- 1
- Page End:
- 11
- Publication Date:
- 2016-05
- Subjects:
- Splicing sites -- PseTNC -- PseTetraNC -- KNN -- mRMR
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2016.02.006 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 556.xml