A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data. (24th May 2017)
- Record Type:
- Journal Article
- Title:
- A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data. (24th May 2017)
- Main Title:
- A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data
- Authors:
- Mo, Qianxing
Shen, Ronglai
Guo, Cui
Vannucci, Marina
Chan, Keith S
Hilsenbeck, Susan G - Abstract:
- SUMMARY: Identification of clinically relevant tumor subtypes and omics signatures is an important task in cancer translational research for precision medicine. Large-scale genomic profiling studies such as The Cancer Genome Atlas (TCGA) Research Network have generated vast amounts of genomic, transcriptomic, epigenomic, and proteomic data. While these studies have provided great resources for researchers to discover clinically relevant tumor subtypes and driver molecular alterations, there are few computationally efficient methods and tools for integrative clustering analysis of these multi-type omics data. Therefore, the aim of this article is to develop a fully Bayesian latent variable method (called iClusterBayes) that can jointly model omics data of continuous and discrete data types for identification of tumor subtypes and relevant omics features. Specifically, the proposed method uses a few latent variables to capture the inherent structure of multiple omics data sets to achieve joint dimension reduction. As a result, the tumor samples can be clustered in the latent variable space and relevant omics features that drive the sample clustering are identified through Bayesian variable selection. This method significantly improve on the existing integrative clustering method iClusterPlus in terms of statistical inference and computational speed. By analyzing TCGA and simulated data sets, we demonstrate the excellent performance of the proposed method in revealingSUMMARY: Identification of clinically relevant tumor subtypes and omics signatures is an important task in cancer translational research for precision medicine. Large-scale genomic profiling studies such as The Cancer Genome Atlas (TCGA) Research Network have generated vast amounts of genomic, transcriptomic, epigenomic, and proteomic data. While these studies have provided great resources for researchers to discover clinically relevant tumor subtypes and driver molecular alterations, there are few computationally efficient methods and tools for integrative clustering analysis of these multi-type omics data. Therefore, the aim of this article is to develop a fully Bayesian latent variable method (called iClusterBayes) that can jointly model omics data of continuous and discrete data types for identification of tumor subtypes and relevant omics features. Specifically, the proposed method uses a few latent variables to capture the inherent structure of multiple omics data sets to achieve joint dimension reduction. As a result, the tumor samples can be clustered in the latent variable space and relevant omics features that drive the sample clustering are identified through Bayesian variable selection. This method significantly improve on the existing integrative clustering method iClusterPlus in terms of statistical inference and computational speed. By analyzing TCGA and simulated data sets, we demonstrate the excellent performance of the proposed method in revealing clinically meaningful tumor subtypes and driver omics features. … (more)
- Is Part Of:
- Biostatistics. Volume 19:Number 1(2018)
- Journal:
- Biostatistics
- Issue:
- Volume 19:Number 1(2018)
- Issue Display:
- Volume 19, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 19
- Issue:
- 1
- Issue Sort Value:
- 2018-0019-0001-0000
- Page Start:
- 71
- Page End:
- 86
- Publication Date:
- 2017-05-24
- Subjects:
- Multi-type omics data -- Integrative clustering -- iCluster -- iClusterPlus -- iClusterBayes -- Latent variable model -- Bayesian variable selection
Medical statistics -- Periodicals
Biometry -- Periodicals
Health risk assessment -- Periodicals
Medicine -- Research -- Statistical methods -- Periodicals
610.727 - Journal URLs:
- http://www3.oup.co.uk/biosts ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/biostatistics/kxx017 ↗
- Languages:
- English
- ISSNs:
- 1465-4644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2089.628000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12169.xml