Investigating microbial co-occurrence patterns based on metagenomic compositional data. (16th June 2015)
- Record Type:
- Journal Article
- Title:
- Investigating microbial co-occurrence patterns based on metagenomic compositional data. (16th June 2015)
- Main Title:
- Investigating microbial co-occurrence patterns based on metagenomic compositional data
- Authors:
- Ban, Yuguang
An, Lingling
Jiang, Hongmei - Abstract:
- Abstract : Motivation: The high-throughput sequencing technologies have provided a powerful tool to study the microbial organisms living in various environments. Characterizing microbial interactions can give us insights into how they live and work together as a community. Metagonomic data are usually summarized in a compositional fashion due to varying sampling/sequencing depths from one sample to another. We study the co-occurrence patterns of microbial organisms using their relative abundance information. Analyzing compositional data using conventional correlation methods has been shown prone to bias that leads to artifactual correlations. Results: We propose a novel method, regularized estimation of the basis covariance based on compositional data (REBACCA), to identify significant co-occurrence patterns by finding sparse solutions to a system with a deficient rank. To be specific, we construct the system using log ratios of count or proportion data and solve the system using the l 1 -norm shrinkage method. Our comprehensive simulation studies show that REBACCA (i) achieves higher accuracy in general than the existing methods when a sparse condition is satisfied; (ii) controls the false positives at a pre-specified level, while other methods fail in various cases and (iii) runs considerably faster than the existing comparable method. REBACCA is also applied to several real metagenomic datasets. Availability and implementation: The R codes for the proposed method areAbstract : Motivation: The high-throughput sequencing technologies have provided a powerful tool to study the microbial organisms living in various environments. Characterizing microbial interactions can give us insights into how they live and work together as a community. Metagonomic data are usually summarized in a compositional fashion due to varying sampling/sequencing depths from one sample to another. We study the co-occurrence patterns of microbial organisms using their relative abundance information. Analyzing compositional data using conventional correlation methods has been shown prone to bias that leads to artifactual correlations. Results: We propose a novel method, regularized estimation of the basis covariance based on compositional data (REBACCA), to identify significant co-occurrence patterns by finding sparse solutions to a system with a deficient rank. To be specific, we construct the system using log ratios of count or proportion data and solve the system using the l 1 -norm shrinkage method. Our comprehensive simulation studies show that REBACCA (i) achieves higher accuracy in general than the existing methods when a sparse condition is satisfied; (ii) controls the false positives at a pre-specified level, while other methods fail in various cases and (iii) runs considerably faster than the existing comparable method. REBACCA is also applied to several real metagenomic datasets. Availability and implementation: The R codes for the proposed method are available at http://faculty.wcas.northwestern.edu/∼hji403/REBACCA.htm Contact: hongmei@northwestern.edu Supplementary information: Supplementary data are available at Bioinformatics online. … (more)
- Is Part Of:
- Bioinformatics. Volume 31:Number 20(2015)
- Journal:
- Bioinformatics
- Issue:
- Volume 31:Number 20(2015)
- Issue Display:
- Volume 31, Issue 20 (2015)
- Year:
- 2015
- Volume:
- 31
- Issue:
- 20
- Issue Sort Value:
- 2015-0031-0020-0000
- Page Start:
- 3322
- Page End:
- 3329
- Publication Date:
- 2015-06-16
- Subjects:
- Bioinformatics -- Periodicals
Genomics -- Data processing -- Periodicals
Computational biology -- Periodicals
572.80285 - Journal URLs:
- http://bioinformatics.oxfordjournals.org ↗
http://firstsearch.oclc.org ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/bioinformatics/btv364 ↗
- Languages:
- English
- ISSNs:
- 1367-4803
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2072.348000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12388.xml