Building and analysis of protein-protein interactions related to diabetes mellitus using support vector machine, biomedical text mining and network analysis. (December 2016)
- Record Type:
- Journal Article
- Title:
- Building and analysis of protein-protein interactions related to diabetes mellitus using support vector machine, biomedical text mining and network analysis. (December 2016)
- Main Title:
- Building and analysis of protein-protein interactions related to diabetes mellitus using support vector machine, biomedical text mining and network analysis
- Authors:
- Vyas, Renu
Bapat, Sanket
Jain, Esha
Karthikeyan, Muthukumarasamy
Tambe, Sanjeev
Kulkarni, Bhaskar D. - Abstract:
- Graphical abstract: Highlights: New protein fingerprints for capturing the topological properties of protein complexes in a linear format. A SVM based predictive model for discriminating diabetes versus non-diabetes complexes with an AUC of 0.78. Model tested on an external data set derived from text mining large number of PubMed abstracts. Network modeling to identify new disease targets. Abstract: In order to understand the molecular mechanism underlying any disease, knowledge about the interacting proteins in the disease pathway is essential. The number of revealed protein-protein interactions (PPI) is still very limited compared to the available protein sequences of different organisms. Experiment based high-throughput technologies though provide some data about these interactions, those are often fairly noisy. Computational techniques for predicting protein–protein interactions therefore assume significance. 1296 binary fingerprints that encode a combination of structural and geometric properties were developed using the crystallographic data of 15, 000 protein complexes in the pdb server. In a case study, these fingerprints were created for proteins implicated in the Type 2 diabetes mellitus disease. The fingerprints were input into a SVM based model for discriminating disease proteins from non disease proteins yielding a classification accuracy of 78.2% (AUC value of 0.78) on an external data set composed of proteins retrieved via text mining of diabetes relatedGraphical abstract: Highlights: New protein fingerprints for capturing the topological properties of protein complexes in a linear format. A SVM based predictive model for discriminating diabetes versus non-diabetes complexes with an AUC of 0.78. Model tested on an external data set derived from text mining large number of PubMed abstracts. Network modeling to identify new disease targets. Abstract: In order to understand the molecular mechanism underlying any disease, knowledge about the interacting proteins in the disease pathway is essential. The number of revealed protein-protein interactions (PPI) is still very limited compared to the available protein sequences of different organisms. Experiment based high-throughput technologies though provide some data about these interactions, those are often fairly noisy. Computational techniques for predicting protein–protein interactions therefore assume significance. 1296 binary fingerprints that encode a combination of structural and geometric properties were developed using the crystallographic data of 15, 000 protein complexes in the pdb server. In a case study, these fingerprints were created for proteins implicated in the Type 2 diabetes mellitus disease. The fingerprints were input into a SVM based model for discriminating disease proteins from non disease proteins yielding a classification accuracy of 78.2% (AUC value of 0.78) on an external data set composed of proteins retrieved via text mining of diabetes related literature. A PPI network was constructed and analysed to explore new disease targets. The integrated approach exemplified here has a potential for identifying disease related proteins, functional annotation and other proteomics studies. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 65(2016)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 65(2016)
- Issue Display:
- Volume 65, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 65
- Issue:
- 2016
- Issue Sort Value:
- 2016-0065-2016-0000
- Page Start:
- 37
- Page End:
- 44
- Publication Date:
- 2016-12
- Subjects:
- Diabetes -- SVM -- Protein-protein interactions -- Machine learning -- Protein interaction network
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2016.09.011 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 7632.xml