Computational method for discovery of biomarker signatures from large, complex data sets. (October 2018)
- Record Type:
- Journal Article
- Title:
- Computational method for discovery of biomarker signatures from large, complex data sets. (October 2018)
- Main Title:
- Computational method for discovery of biomarker signatures from large, complex data sets
- Authors:
- Makarov, Vladimir
Gorlin, Alex - Abstract:
- Graphical abstract: Highlights: Mathematical methodology for mining for reliable biomarker panels from large multivariate data sets. Validated on the Drug Matrix toxicogenomics data set, but applicable broadly. Able to recognize 176 out of 316 compounds in Drug Matrix at 90% or greater accuracy. Abstract: We present an efficient method for identifying of reliable biomarker panels from large multivariate data sets that typically result from experiments that monitor changes in RNA, small molecule, or protein abundance. Our computational methodology is developed and validated on the toxicogenomics database Drug Matrix that in its largest category contains 1656 recognition targets, characterized by the toxicant, dose and time (or duration) of the exposure. We were able to recognize both individual experimental conditions (compound, dose and time combinations) and the cases where the values for dose and time variables fall within the intervals in the training data, but do not match the training data exactly. Inclusion of gene expression information for multiple organs improved accuracy of recognition. Inclusion of time response information into consideration allowed us to develop particularly accurate marker panels for a large number of targets: we were able to recognize 176 compounds (out of 316) at greater than 90% accuracy. The presented methodology has an immediate application for discovery of diagnostic biomarker panels for exposure to various toxicity hazards, and may alsoGraphical abstract: Highlights: Mathematical methodology for mining for reliable biomarker panels from large multivariate data sets. Validated on the Drug Matrix toxicogenomics data set, but applicable broadly. Able to recognize 176 out of 316 compounds in Drug Matrix at 90% or greater accuracy. Abstract: We present an efficient method for identifying of reliable biomarker panels from large multivariate data sets that typically result from experiments that monitor changes in RNA, small molecule, or protein abundance. Our computational methodology is developed and validated on the toxicogenomics database Drug Matrix that in its largest category contains 1656 recognition targets, characterized by the toxicant, dose and time (or duration) of the exposure. We were able to recognize both individual experimental conditions (compound, dose and time combinations) and the cases where the values for dose and time variables fall within the intervals in the training data, but do not match the training data exactly. Inclusion of gene expression information for multiple organs improved accuracy of recognition. Inclusion of time response information into consideration allowed us to develop particularly accurate marker panels for a large number of targets: we were able to recognize 176 compounds (out of 316) at greater than 90% accuracy. The presented methodology has an immediate application for discovery of diagnostic biomarker panels for exposure to various toxicity hazards, and may also be useful for development of biological markers for medical applications. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 76(2018)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 76(2018)
- Issue Display:
- Volume 76, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 76
- Issue:
- 2018
- Issue Sort Value:
- 2018-0076-2018-0000
- Page Start:
- 161
- Page End:
- 168
- Publication Date:
- 2018-10
- Subjects:
- Biomarker -- Microarray -- Gene expression -- Chemical -- Classification
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2018.07.008 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23145.xml