Interspecies gene function prediction using semantic similarity. Issue 4 (December 2016)
- Record Type:
- Journal Article
- Title:
- Interspecies gene function prediction using semantic similarity. Issue 4 (December 2016)
- Main Title:
- Interspecies gene function prediction using semantic similarity
- Authors:
- Yu, Guoxian
Luo, Wei
Fu, Guangyuan
Wang, Jun - Abstract:
- Abstract Background Gene Ontology (GO) is a collaborative project that maintains and develops controlled vocabulary (or terms) to describe the molecular function, biological roles and cellular location of gene products in a hierarchical ontology. GO also provides GO annotations that associate genes with GO terms. GO consortium independently and collaboratively annotate terms to gene products, mainly from model organisms (or species) they are interested in. Due to experiment ethics, research interests of biologists and resources limitations, homologous genes from different species currently are annotated with different terms. These differences can be more attributed to incomplete annotations of genes than to functional difference between them. Results Semantic similarity between genes is derived from GO hierarchy and annotations of genes. It is positively correlated with the similarity derived from various types of biological data and has been applied to predict gene function. In this paper, we investigate whether it is possible to replenish annotations of incompletely annotated genes by using semantic similarity between genes from two species with homology. For this investigation, we utilize three representative semantic similarity metrics to compute similarity between genes from two species. Next, we determine thek nearest neighborhood genes from the two species based on the chosen metric and then use terms annotated tok neighbors of a gene to replenish annotations of thatAbstract Background Gene Ontology (GO) is a collaborative project that maintains and develops controlled vocabulary (or terms) to describe the molecular function, biological roles and cellular location of gene products in a hierarchical ontology. GO also provides GO annotations that associate genes with GO terms. GO consortium independently and collaboratively annotate terms to gene products, mainly from model organisms (or species) they are interested in. Due to experiment ethics, research interests of biologists and resources limitations, homologous genes from different species currently are annotated with different terms. These differences can be more attributed to incomplete annotations of genes than to functional difference between them. Results Semantic similarity between genes is derived from GO hierarchy and annotations of genes. It is positively correlated with the similarity derived from various types of biological data and has been applied to predict gene function. In this paper, we investigate whether it is possible to replenish annotations of incompletely annotated genes by using semantic similarity between genes from two species with homology. For this investigation, we utilize three representative semantic similarity metrics to compute similarity between genes from two species. Next, we determine thek nearest neighborhood genes from the two species based on the chosen metric and then use terms annotated tok neighbors of a gene to replenish annotations of that gene. We perform experiments on archived (from Jan-2014 to Jan-2016) GO annotations of four species (Human, Mouse, Danio rerio and Arabidopsis thaliana) to assess the contribution of semantic similarity between genes from different species. The experimental results demonstrate that: (1) semantic similarity between genes from homologous species contributes much more on the improved accuracy (by 53.22%) than genes from single species alone, and genes from two species with low homology; (2) GO annotations of genes from homologous species are complementary to each other. Conclusions Our study shows that semantic similarity based interspecies gene function annotation from homologous species is more prominent than traditional intraspecies approaches. This work can promote more research on semantic similarity based function prediction across species. … (more)
- Is Part Of:
- BMC systems biology. Volume 10:Issue 4(2016)
- Journal:
- BMC systems biology
- Issue:
- Volume 10:Issue 4(2016)
- Issue Display:
- Volume 10, Issue 4 (2016)
- Year:
- 2016
- Volume:
- 10
- Issue:
- 4
- Issue Sort Value:
- 2016-0010-0004-0000
- Page Start:
- 495
- Page End:
- 507
- Publication Date:
- 2016-12
- Subjects:
- GO annotations -- Semantic similarity -- Gene function prediction -- Interspecies
Biological systems -- Periodicals
Biology -- Research -- Periodicals
Cell physiology -- Periodicals
Genes -- Analysis -- Periodicals
571 - Journal URLs:
- http://www.biomedcentral.com/bmcsystbiol/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12918-016-0361-5 ↗
- Languages:
- English
- ISSNs:
- 1752-0509
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 10954.xml