Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition. Issue 4 (December 2016)
- Record Type:
- Journal Article
- Title:
- Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition. Issue 4 (December 2016)
- Main Title:
- Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition
- Authors:
- Huang, Yu-An
You, Zhu-Hong
Chen, Xing
Yan, Gui-Ying - Abstract:
- Abstract Background Protein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. High-throughput biological technologies can be used to identify new PPIs, but they are expensive, time-consuming, and tedious. Therefore, computational methods for predicting PPIs have an important role. For the past years, an increasing number of computational methods such as protein structure-based approaches have been proposed for predicting PPIs. The major limitation in principle of these methods lies in the prior information of the protein to infer PPIs. Therefore, it is of much significance to develop computational methods which only use the information of protein amino acids sequence. Results Here, we report a highly efficient approach for predicting PPIs. The main improvements come from the use of a novel protein sequence representation by combining continuous wavelet descriptor and Chou's pseudo amino acid composition (PseAAC), and from adopting weighted sparse representation based classifier (WSRC). This method, cross-validated on the PPIs datasets ofSaccharomyces cerevisiae, Human andH. pylori, achieves an excellent results with accuracies as high as 92.50%, 95.54% and 84.28% respectively, significantly better than previously proposed methods. Extensive experiments are performed to compare the proposed method with state-of-the-art SupportAbstract Background Protein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. High-throughput biological technologies can be used to identify new PPIs, but they are expensive, time-consuming, and tedious. Therefore, computational methods for predicting PPIs have an important role. For the past years, an increasing number of computational methods such as protein structure-based approaches have been proposed for predicting PPIs. The major limitation in principle of these methods lies in the prior information of the protein to infer PPIs. Therefore, it is of much significance to develop computational methods which only use the information of protein amino acids sequence. Results Here, we report a highly efficient approach for predicting PPIs. The main improvements come from the use of a novel protein sequence representation by combining continuous wavelet descriptor and Chou's pseudo amino acid composition (PseAAC), and from adopting weighted sparse representation based classifier (WSRC). This method, cross-validated on the PPIs datasets ofSaccharomyces cerevisiae, Human andH. pylori, achieves an excellent results with accuracies as high as 92.50%, 95.54% and 84.28% respectively, significantly better than previously proposed methods. Extensive experiments are performed to compare the proposed method with state-of-the-art Support Vector Machine (SVM) classifier. Conclusions The outstanding results yield by our model that the proposed feature extraction method combing two kinds of descriptors have strong expression ability and are expected to provide comprehensive and effective information for machine learning-based classification models. In addition, the prediction performance in the comparison experiments shows the well cooperation between the combined feature and WSRC. Thus, the proposed method is a very efficient method to predict PPIs and may be a useful supplementary tool for future proteomics studies. … (more)
- Is Part Of:
- BMC systems biology. Volume 10:Issue 4(2016)
- Journal:
- BMC systems biology
- Issue:
- Volume 10:Issue 4(2016)
- Issue Display:
- Volume 10, Issue 4 (2016)
- Year:
- 2016
- Volume:
- 10
- Issue:
- 4
- Issue Sort Value:
- 2016-0010-0004-0000
- Page Start:
- 485
- Page End:
- 494
- Publication Date:
- 2016-12
- Subjects:
- Protein-protein interactions -- Protein sequence -- Continuous wavelet transform -- Sparse representation based classifier
Biological systems -- Periodicals
Biology -- Research -- Periodicals
Cell physiology -- Periodicals
Genes -- Analysis -- Periodicals
571 - Journal URLs:
- http://www.biomedcentral.com/bmcsystbiol/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12918-016-0360-6 ↗
- Languages:
- English
- ISSNs:
- 1752-0509
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 10954.xml