Metabolic networks classification and knowledge discovery by information granulation. (February 2020)
- Record Type:
- Journal Article
- Title:
- Metabolic networks classification and knowledge discovery by information granulation. (February 2020)
- Main Title:
- Metabolic networks classification and knowledge discovery by information granulation
- Authors:
- Martino, Alessio
Giuliani, Alessandro
Todde, Virginia
Bizzarri, Mariano
Rizzi, Antonello - Abstract:
- Highlights: Novel information granulation and knowledge discovery technique for structured data. Information granules as relevant substructures (high sensitivity and specificity). Information granules are further filtered at classification stage (suboptimal subset). Tests on real metabolic pathways data classification show remarkable performances. Knowledge discovery over filtered granules paves the way for further studies. Abstract: Graphs are powerful structures able to capture topological and semantic information from data, hence suitable for modelling a plethora of real-world (complex) systems. For this reason, graph-based pattern recognition gained a lot of attention in recent years. In this paper, a general-purpose classification system in the graphs domain is presented. When most of the information of the available patterns can be encoded in edge labels, an information granulation-based approach is highly discriminant and allows for the identification of semantically meaningful edges. The proposed classification system has been tested on the entire set of organisms (5299) for which metabolic networks are known, allowing for both a perfect mirroring of the underlying taxonomy and the identification of most discriminant metabolic reactions and pathways. The widespread diffusion of graph (network) structures in biology makes the proposed pattern recognition approach potentially very useful in many different fields of application. More specifically, the possibility toHighlights: Novel information granulation and knowledge discovery technique for structured data. Information granules as relevant substructures (high sensitivity and specificity). Information granules are further filtered at classification stage (suboptimal subset). Tests on real metabolic pathways data classification show remarkable performances. Knowledge discovery over filtered granules paves the way for further studies. Abstract: Graphs are powerful structures able to capture topological and semantic information from data, hence suitable for modelling a plethora of real-world (complex) systems. For this reason, graph-based pattern recognition gained a lot of attention in recent years. In this paper, a general-purpose classification system in the graphs domain is presented. When most of the information of the available patterns can be encoded in edge labels, an information granulation-based approach is highly discriminant and allows for the identification of semantically meaningful edges. The proposed classification system has been tested on the entire set of organisms (5299) for which metabolic networks are known, allowing for both a perfect mirroring of the underlying taxonomy and the identification of most discriminant metabolic reactions and pathways. The widespread diffusion of graph (network) structures in biology makes the proposed pattern recognition approach potentially very useful in many different fields of application. More specifically, the possibility to have a reliable metric to compare different metabolic systems is instrumental in emerging fields like microbiome analysis and, more in general, for proposing metabolic networks as a universal phenotype spanning the entire tree of life and in direct contact with environmental cues. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 84(2020)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 84(2020)
- Issue Display:
- Volume 84, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 84
- Issue:
- 2020
- Issue Sort Value:
- 2020-0084-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-02
- Subjects:
- 68T10 (Pattern Recognition -- Speech Recognition) -- 62H30 (Classification and Discrimination -- Cluster Analysis) -- 05C82 (Small world graphs -- complex networks) -- 92C42 (Systems biology -- networks)
Granular computing -- Embedding spaces -- Support vector machines -- Computational biology -- Metabolic pathways -- Complex networks
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2019.107187 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 12624.xml