Covariances Simultaneous Component Analysis: a new method within a framework for modeling covariances. (25th March 2015)
- Record Type:
- Journal Article
- Title:
- Covariances Simultaneous Component Analysis: a new method within a framework for modeling covariances. (25th March 2015)
- Main Title:
- Covariances Simultaneous Component Analysis: a new method within a framework for modeling covariances
- Authors:
- Smilde, Age K.
Timmerman, Marieke E.
Saccenti, Edoardo
Jansen, Jeroen J.
Hoefsloot, Huub C. J. - Abstract:
- Abstract : In modern omics research, it is more rule than exception that multiple data sets are collected in a study pertaining to the same biological organism. In such cases, it is worthwhile to analyze all data tables simultaneously to arrive at global information of the biological system. This is the area of data fusion or multi‐set analysis, which is a lively research topic in chemometrics, bioinformatics, and biostatistics. Most methods of analyzing such complex data focus on group means, treatment effects, or time courses. There is also information present in the covariances among variables within a group, because this relates directly to individual differences, heterogeneity of responses, and changes of regulation in the biological system. We present a framework for analyzing covariance matrices and a new method that fits nicely in this framework. This new method is based on combining covariance prototypes using simultaneous components and is, therefore, coined Covariances Simultaneous Component Analysis (COVSCA). We present the framework and our new method in mathematical terms, thereby explaining the (dis)similarities of the methods. Systems biology models based on differential equations illustrate the type of variation generated in real‐life biological systems and how this type of variation can be modeled within the framework and with COVSCA. The method is subsequently applied to two real‐life data sets from human and plant metabolomics studies showing biologicallyAbstract : In modern omics research, it is more rule than exception that multiple data sets are collected in a study pertaining to the same biological organism. In such cases, it is worthwhile to analyze all data tables simultaneously to arrive at global information of the biological system. This is the area of data fusion or multi‐set analysis, which is a lively research topic in chemometrics, bioinformatics, and biostatistics. Most methods of analyzing such complex data focus on group means, treatment effects, or time courses. There is also information present in the covariances among variables within a group, because this relates directly to individual differences, heterogeneity of responses, and changes of regulation in the biological system. We present a framework for analyzing covariance matrices and a new method that fits nicely in this framework. This new method is based on combining covariance prototypes using simultaneous components and is, therefore, coined Covariances Simultaneous Component Analysis (COVSCA). We present the framework and our new method in mathematical terms, thereby explaining the (dis)similarities of the methods. Systems biology models based on differential equations illustrate the type of variation generated in real‐life biological systems and how this type of variation can be modeled within the framework and with COVSCA. The method is subsequently applied to two real‐life data sets from human and plant metabolomics studies showing biologically meaningful results. Copyright © 2015 John Wiley & Sons, Ltd. Abstract : In modern omics research, multiple data sets are often collected in a study pertaining to the same biological organism. In such cases, it is worthwhile to analyze all data sets simultaneously. There is information present in the covariances among variables within a data set. We present a framework for analyzing covariance matrices and a new method (Covariances Simultaneous Component Analysis) that fits nicely in this framework. The method is applied to two real‐life metabolomics data sets showing biologically meaningful results. … (more)
- Is Part Of:
- Journal of chemometrics. Volume 29:Number 5(2015:May)
- Journal:
- Journal of chemometrics
- Issue:
- Volume 29:Number 5(2015:May)
- Issue Display:
- Volume 29, Issue 5 (2015)
- Year:
- 2015
- Volume:
- 29
- Issue:
- 5
- Issue Sort Value:
- 2015-0029-0005-0000
- Page Start:
- 277
- Page End:
- 288
- Publication Date:
- 2015-03-25
- Subjects:
- indirect fitting -- derived data -- INDSCAL -- IDIOSCAL -- multiblock data -- metabolomics
Chemistry -- Mathematics -- Periodicals
Chemistry -- Statistical methods -- Periodicals
542.85 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cem.2707 ↗
- Languages:
- English
- ISSNs:
- 0886-9383
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4957.380000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5350.xml