HashGO: hashing gene ontology for protein function prediction. (December 2017)
- Record Type:
- Journal Article
- Title:
- HashGO: hashing gene ontology for protein function prediction. (December 2017)
- Main Title:
- HashGO: hashing gene ontology for protein function prediction
- Authors:
- Yu, Guoxian
Zhao, Yingwen
Lu, Chang
Wang, Jun - Abstract:
- Abstract: Gene ontology (GO) is a standardized and controlled vocabulary of terms that describe the molecular functions, biological roles and cellular locations of proteins. GO terms and GO hierarchy are regularly updated as the accumulated biological knowledge. More than 50, 000 terms are included in GO and each protein is annotated with several or dozens of these terms. Therefore, accurately predicting the association between proteins and massive GO terms is rather challenging. To accurately predict the association between massive GO terms and proteins, we proposed a method called Hash ing GO for protein function prediction (HashGO in short). HashGO firstly adopts a protein-term association matrix to store available GO annotations of proteins. Then, it tailors a graph hashing method to explore the underlying structure between GO terms and to obtain a series of hash functions to compress the high-dimensional protein-term association matrix into a low-dimensional one. Next, HashGO computes the semantic similarity between proteins based on Hamming distance on that low-dimensional matrix. After that, it predicts missing annotations of a protein based on the annotations of its semantic neighbors. Experimental results on archived GO annotations of two model species (Yeast and Human) show that HashGO not only more accurately predicts functions than other related approaches, but also runs faster than them.
- Is Part Of:
- Computational biology and chemistry. Volume 71(2017)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 71(2017)
- Issue Display:
- Volume 71, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 71
- Issue:
- 2017
- Issue Sort Value:
- 2017-0071-2017-0000
- Page Start:
- 264
- Page End:
- 273
- Publication Date:
- 2017-12
- Subjects:
- Gene ontology -- Protein function prediction -- Graph hashing -- Semantic similarity
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2017.09.010 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5452.xml