An improved poly(A) motifs recognition method based on decision level fusion. (February 2015)
- Record Type:
- Journal Article
- Title:
- An improved poly(A) motifs recognition method based on decision level fusion. (February 2015)
- Main Title:
- An improved poly(A) motifs recognition method based on decision level fusion
- Authors:
- Zhang, Shanxin
Han, Jiuqiang
Liu, Jun
Zheng, Jiguang
Liu, Ruiling - Abstract:
- Graphical abstract: Highlights: Proposed a novel comparable PCA and SVM based poly(A) motif recognition method. Proposed a novel DS evidence theory based recognition method with higher accuracy. Proposed a novel basic probability assignment method used for DS evidence theory. Abstract: Polyadenylation is the process of addition of poly(A) tail to mRNA 3′ ends. Identification of motifs controlling polyadenylation plays an essential role in improving genome annotation accuracy and better understanding of the mechanisms governing gene regulation. The bioinformatics methods used for poly(A) motifs recognition have demonstrated that information extracted from sequences surrounding the candidate motifs can differentiate true motifs from the false ones greatly. However, these methods depend on either domain features or string kernels. To date, methods combining information from different sources have not been found yet. Here, we proposed an improved poly(A) motifs recognition method by combing different sources based on decision level fusion. First of all, two novel prediction methods was proposed based on support vector machine (SVM): one method is achieved by using the domain-specific features and principle component analysis (PCA) method to eliminate the redundancy (PCA–SVM); the other method is based on Oligo string kernel (Oligo-SVM). Then we proposed a novel machine-learning method for poly(A) motif prediction by marrying four poly(A) motifs recognition methods, including twoGraphical abstract: Highlights: Proposed a novel comparable PCA and SVM based poly(A) motif recognition method. Proposed a novel DS evidence theory based recognition method with higher accuracy. Proposed a novel basic probability assignment method used for DS evidence theory. Abstract: Polyadenylation is the process of addition of poly(A) tail to mRNA 3′ ends. Identification of motifs controlling polyadenylation plays an essential role in improving genome annotation accuracy and better understanding of the mechanisms governing gene regulation. The bioinformatics methods used for poly(A) motifs recognition have demonstrated that information extracted from sequences surrounding the candidate motifs can differentiate true motifs from the false ones greatly. However, these methods depend on either domain features or string kernels. To date, methods combining information from different sources have not been found yet. Here, we proposed an improved poly(A) motifs recognition method by combing different sources based on decision level fusion. First of all, two novel prediction methods was proposed based on support vector machine (SVM): one method is achieved by using the domain-specific features and principle component analysis (PCA) method to eliminate the redundancy (PCA–SVM); the other method is based on Oligo string kernel (Oligo-SVM). Then we proposed a novel machine-learning method for poly(A) motif prediction by marrying four poly(A) motifs recognition methods, including two state-of-the-art methods (Random Forest (RF) and HMM-SVM), and two novel proposed methods (PCA–SVM and Oligo-SVM). A decision level information fusion method was employed to combine the decision values of different classifiers by applying the DS evidence theory. We evaluated our method on a comprehensive poly(A) dataset that consists of 14, 740 samples on 12 variants of poly(A) motifs and 2750 samples containing none of these motifs. Our method has achieved accuracy up to 86.13%. Compared with the four classifiers, our evidence theory based method reduces the average error rate by about 30%, 27%, 26% and 16%, respectively. The experimental results suggest that the proposed method is more effective for poly(A) motif recognition. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 54(2015)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 54(2015)
- Issue Display:
- Volume 54, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 54
- Issue:
- 2015
- Issue Sort Value:
- 2015-0054-2015-0000
- Page Start:
- 49
- Page End:
- 56
- Publication Date:
- 2015-02
- Subjects:
- Polyadenylation motifs -- Support vector machine -- Oligo string kernel -- Increment of diversity -- Evidence theory
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2014.12.001 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5340.xml