OTUCD: Unsupervised GCN based metagenomics non-overlapping community detection. (June 2022)
- Record Type:
- Journal Article
- Title:
- OTUCD: Unsupervised GCN based metagenomics non-overlapping community detection. (June 2022)
- Main Title:
- OTUCD: Unsupervised GCN based metagenomics non-overlapping community detection
- Authors:
- Zhang, Zhongqing
Jiao, Qiqi
Zhang, Yang
Liu, Bo
Wang, Yadong
Li, Junyi - Abstract:
- Abstract: Metagenomics is a discipline that studies the genetic material of all tiny organisms in the biological environment. In recent years, the interaction between metagenomic microbial communities, the transfer of horizontal genes, and the dynamic changes of microbial ecosystems have attracted more and more attention. It is of great significance to use the community detection algorithm to divide the metagenomic microbes into modules, and it has a positive guiding role for the follow-up research on human, drug, microbial interaction study and drug prediction and development. At present, there are challenges in mining the effective information hidden in large-scale microbial sequence data. The non-linear characteristics and non-scalability of microbial sequence data still bother people. This paper proposes an end-to-end unsupervised GCN learning model OTUCD (Operational Classification Unit Community Detection), which divides large-scale metagenomic sequence data into potential gene modules. We construct an OTU network, and then performs subsequent nonoverlapping community detection task with graph convolutional networks. Experimental scores show that the community detection effect of this method is better than other latest metagenomic algorithms. Graphical Abstract: ga1 Highlights: OTUCD innovatively uses GCN to discover metagenomic similarity network information. Microbiota module division is effectively carried out by OTUCD. Experimental results show that OTUCD hasAbstract: Metagenomics is a discipline that studies the genetic material of all tiny organisms in the biological environment. In recent years, the interaction between metagenomic microbial communities, the transfer of horizontal genes, and the dynamic changes of microbial ecosystems have attracted more and more attention. It is of great significance to use the community detection algorithm to divide the metagenomic microbes into modules, and it has a positive guiding role for the follow-up research on human, drug, microbial interaction study and drug prediction and development. At present, there are challenges in mining the effective information hidden in large-scale microbial sequence data. The non-linear characteristics and non-scalability of microbial sequence data still bother people. This paper proposes an end-to-end unsupervised GCN learning model OTUCD (Operational Classification Unit Community Detection), which divides large-scale metagenomic sequence data into potential gene modules. We construct an OTU network, and then performs subsequent nonoverlapping community detection task with graph convolutional networks. Experimental scores show that the community detection effect of this method is better than other latest metagenomic algorithms. Graphical Abstract: ga1 Highlights: OTUCD innovatively uses GCN to discover metagenomic similarity network information. Microbiota module division is effectively carried out by OTUCD. Experimental results show that OTUCD has achieved the most advanced performance. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 98(2022)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 98(2022)
- Issue Display:
- Volume 98, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 98
- Issue:
- 2022
- Issue Sort Value:
- 2022-0098-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06
- Subjects:
- GNN Graph neural network -- GCN Graph convolutional network -- GAE Graph Autoencoder
Metagenomics -- Non-overlapping community detection -- Large-scale microbial sequence data
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2022.107670 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21569.xml