Computational identification of circular RNAs based on conformational and thermodynamic properties in the flanking introns. (April 2016)
- Record Type:
- Journal Article
- Title:
- Computational identification of circular RNAs based on conformational and thermodynamic properties in the flanking introns. (April 2016)
- Main Title:
- Computational identification of circular RNAs based on conformational and thermodynamic properties in the flanking introns
- Authors:
- Liu, Ze
Han, Jiuqiang
Lv, Hongqiang
Liu, Jun
Liu, Ruiling - Abstract:
- Graphical abstract: Highlights: A computational method was proposed to distinguish circRNAs from non-circularized, expressed exons. Thermodynamic and conformational properties were extracted for model training. Two feature selection methods were used to construct the optimized feature subset. Our method received a high performance using 10-fold cross-validation on the training dataset. Abstract: Circular RNAs (circRNAs) were found more than 30 years ago, but have been treated as molecular flukes in a long time. Combining deep sequencing studies with bioinformatics technique, thousands of endogenous circRNAs have been found in mammalian cells, and some researchers have proved that several circRNAs act as competing endogenous RNAs (ceRNAs) to regulate gene expression. However, the mechanism by which the precursor mRNA to be transformed into a circular RNA or a linear mRNA is largely unknown. In this paper, we attempted to bioinformatically identify shared genomic features that might further elucidate the mechanism of formation and proposed a SVM-based model to distinguish circRNAs from non-circularized, expressed exons. Firstly, conformational and thermodynamic dinucleotide properties in the flanking introns were extracted as potential features. Secondly, two feature selection methods were applied to gain the optimal feature subset. Our 10-fold cross-validation results showed that the model can be used to distinguish circRNAs from non-circularized, expressed exons with an SnGraphical abstract: Highlights: A computational method was proposed to distinguish circRNAs from non-circularized, expressed exons. Thermodynamic and conformational properties were extracted for model training. Two feature selection methods were used to construct the optimized feature subset. Our method received a high performance using 10-fold cross-validation on the training dataset. Abstract: Circular RNAs (circRNAs) were found more than 30 years ago, but have been treated as molecular flukes in a long time. Combining deep sequencing studies with bioinformatics technique, thousands of endogenous circRNAs have been found in mammalian cells, and some researchers have proved that several circRNAs act as competing endogenous RNAs (ceRNAs) to regulate gene expression. However, the mechanism by which the precursor mRNA to be transformed into a circular RNA or a linear mRNA is largely unknown. In this paper, we attempted to bioinformatically identify shared genomic features that might further elucidate the mechanism of formation and proposed a SVM-based model to distinguish circRNAs from non-circularized, expressed exons. Firstly, conformational and thermodynamic dinucleotide properties in the flanking introns were extracted as potential features. Secondly, two feature selection methods were applied to gain the optimal feature subset. Our 10-fold cross-validation results showed that the model can be used to distinguish circRNAs from non-circularized, expressed exons with an Sn of 0.884, Sp of 0.900, ACC of 0.892, MCC of 0.784, respectively. The identification results suggest that conformational and thermodynamic properties in the flanking introns are closely related to the formation of circRNAs. Datasets and the tool involved in this paper are all available athttps://sourceforge.net/projects/predicircrnatool/files/ . … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 61(2016)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 61(2016)
- Issue Display:
- Volume 61, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 61
- Issue:
- 2016
- Issue Sort Value:
- 2016-0061-2016-0000
- Page Start:
- 221
- Page End:
- 225
- Publication Date:
- 2016-04
- Subjects:
- Circular RNA -- Competing endogenous RNA -- Support vector machine -- 10-Fold cross-validation
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2016.02.003 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 2321.xml