Data-driven characterization of molecular phenotypes across heterogeneous sample collections. Issue 13 (24th April 2019)
- Record Type:
- Journal Article
- Title:
- Data-driven characterization of molecular phenotypes across heterogeneous sample collections. Issue 13 (24th April 2019)
- Main Title:
- Data-driven characterization of molecular phenotypes across heterogeneous sample collections
- Authors:
- Mehtonen, Juha
Pölönen, Petri
Häyrynen, Sergei
Dufva, Olli
Lin, Jake
Liuksiala, Thomas
Granberg, Kirsi
Lohi, Olli
Hautamäki, Ville
Nykter, Matti
Heinäniemi, Merja - Abstract:
- Abstract: Existing large gene expression data repositories hold enormous potential to elucidate disease mechanisms, characterize changes in cellular pathways, and to stratify patients based on molecular profiles. To achieve this goal, integrative resources and tools are needed that allow comparison of results across datasets and data types. We propose an intuitive approach for data-driven stratifications of molecular profiles and benchmark our methodology using the dimensionality reduction algorithm t-distributed stochastic neighbor embedding (t-SNE) with multi-study and multi-platform data on hematological malignancies. Our approach enables assessing the contribution of biological versus technical variation to sample clustering, direct incorporation of additional datasets to the same low dimensional representation, comparison of molecular disease subtypes identified from separate t-SNE representations, and characterization of the obtained clusters based on pathway databases and additional data. In this manner, we performed an integrative analysis across multi-omics acute myeloid leukemia studies. Our approach indicated new molecular subtypes with differential survival and drug responsiveness among samples lacking fusion genes, including a novel myelodysplastic syndrome-like cluster and a cluster characterized with CEBPA mutations and differential activity of the S-adenosylmethionine-dependent DNA methylation pathway. In summary, integration across multiple studies can helpAbstract: Existing large gene expression data repositories hold enormous potential to elucidate disease mechanisms, characterize changes in cellular pathways, and to stratify patients based on molecular profiles. To achieve this goal, integrative resources and tools are needed that allow comparison of results across datasets and data types. We propose an intuitive approach for data-driven stratifications of molecular profiles and benchmark our methodology using the dimensionality reduction algorithm t-distributed stochastic neighbor embedding (t-SNE) with multi-study and multi-platform data on hematological malignancies. Our approach enables assessing the contribution of biological versus technical variation to sample clustering, direct incorporation of additional datasets to the same low dimensional representation, comparison of molecular disease subtypes identified from separate t-SNE representations, and characterization of the obtained clusters based on pathway databases and additional data. In this manner, we performed an integrative analysis across multi-omics acute myeloid leukemia studies. Our approach indicated new molecular subtypes with differential survival and drug responsiveness among samples lacking fusion genes, including a novel myelodysplastic syndrome-like cluster and a cluster characterized with CEBPA mutations and differential activity of the S-adenosylmethionine-dependent DNA methylation pathway. In summary, integration across multiple studies can help to identify novel molecular disease subtypes and generate insight into disease biology. … (more)
- Is Part Of:
- Nucleic acids research. Volume 47:Issue 13(2019)
- Journal:
- Nucleic acids research
- Issue:
- Volume 47:Issue 13(2019)
- Issue Display:
- Volume 47, Issue 13 (2019)
- Year:
- 2019
- Volume:
- 47
- Issue:
- 13
- Issue Sort Value:
- 2019-0047-0013-0000
- Page Start:
- e76
- Page End:
- e76
- Publication Date:
- 2019-04-24
- Subjects:
- Nucleic acids -- Periodicals
Molecular biology -- Periodicals
572.805 - Journal URLs:
- http://nar.oxfordjournals.org/ ↗
http://www.ncbi.nlm.nih.gov/pmc/journals/4 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/nar/gkz281 ↗
- Languages:
- English
- ISSNs:
- 0305-1048
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6183.850000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11991.xml