Reprint of "Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction". (December 2015)
- Record Type:
- Journal Article
- Title:
- Reprint of "Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction". (December 2015)
- Main Title:
- Reprint of "Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction"
- Authors:
- Rouillard, Andrew D.
Wang, Zichen
Ma'ayan, Avi - Abstract:
- Graphical abstract: Highlights: A small fraction of biomedical Big Data is converted to useful knowledge or reused. Overview of a collection of structured mostly molecular mammalian biomedical Big Data resources. Biases within data from these resources are suspected. Data abstraction to attribute tables, networks and gene-sets enables reuse of biomedical datasets for integrative analyses. Once data is abstracted it can be integrated and analyzed using supervised, unsupervised and integrative methods. Abstract: With advances in genomics, transcriptomics, metabolomics and proteomics, and more expansive electronic clinical record monitoring, as well as advances in computation, we have entered the Big Data era in biomedical research. Data gathering is growing rapidly while only a small fraction of this data is converted to useful knowledge or reused in future studies. To improve this, an important concept that is often overlooked is data abstraction. To fuse and reuse biomedical datasets from diverse resources, data abstraction is frequently required. Here we summarize some of the major Big Data biomedical research resources for genomics, proteomics and phenotype data, collected from mammalian cells, tissues and organisms. We then suggest simple data abstraction methods for fusing this diverse but related data. Finally, we demonstrate examples of the potential utility of such data integration efforts, while warning about the inherit biases that exist within such data.
- Is Part Of:
- Computational biology and chemistry. Volume 59:Part B(2015)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 59:Part B(2015)
- Issue Display:
- Volume 59, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 59
- Issue:
- 2015
- Issue Sort Value:
- 2015-0059-2015-0000
- Page Start:
- 123
- Page End:
- 138
- Publication Date:
- 2015-12
- Subjects:
- Data integration -- Bioinformatics -- Systems biology -- Systems pharmacology -- Network biology
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2015.08.005 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 349.xml