Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis. (25th September 2012)
- Record Type:
- Journal Article
- Title:
- Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis. (25th September 2012)
- Main Title:
- Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis
- Authors:
- Devignes, Marie-Dominique
Benabderrahmane, Sidahmed
Smaïl-Tabbone, Malika
Napoli, Amedeo
Poch, Olivier - Abstract:
- Functional classification aims at grouping genes according to their molecular function or the biological process they participate in. Evaluating the validity of such unsupervised gene classification remains a challenge given the variety of distance measures and classification algorithms that can be used. We evaluate here functional classification of genes with the help of reference sets: KEGG (Kyoto Encyclopaedia of Genes and Genomes) pathways and Pfam clans. These sets represent ground truth for any distance based on GO (Gene Ontology) biological process and molecular function annotations respectively. Overlaps between clusters and reference sets are estimated by the F-score method. We test our previously described IntelliGO semantic distance with hierarchical and fuzzy C-means clustering and we compare results with the state-of-the-art DAVID (Database for Annotation Visualisation and Integrated Discovery) functional classification method. Finally, study of best matching clusters to reference sets leads us to propose a set-difference method for discovering missing information.
- Is Part Of:
- International journal of computational biology and drug design. Volume 5:Number 3/4(2012)
- Journal:
- International journal of computational biology and drug design
- Issue:
- Volume 5:Number 3/4(2012)
- Issue Display:
- Volume 5, Issue 3/4 (2012)
- Year:
- 2012
- Volume:
- 5
- Issue:
- 3/4
- Issue Sort Value:
- 2012-0005-NaN-0000
- Page Start:
- 245
- Page End:
- 260
- Publication Date:
- 2012-09-25
- Subjects:
- semantic similarity measure -- gene ontology -- gene functional classification -- hierarchical clustering -- fuzzy clustering -- overlap analysis
Computational biology -- Periodicals
Drugs -- Design -- Periodicals
570.285 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijcbdd ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1756-0756
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 11549.xml