Prediction of an outcome using NETwork Clusters (NET-C). (February 2021)
- Record Type:
- Journal Article
- Title:
- Prediction of an outcome using NETwork Clusters (NET-C). (February 2021)
- Main Title:
- Prediction of an outcome using NETwork Clusters (NET-C)
- Authors:
- Lee, Jai Woo
Zhou, Jie
Moen, Erika L.
Punshon, Tracy
Hoen, Anne G.
Romano, Megan E.
Karagas, Margaret R.
Gui, Jiang - Abstract:
- Graphical abstract: Highlights: NET-C identified a network of features that predict birth weight. It outperforms hierarchical clustering and random forest regression in predicting birth weight in New Hampshire Birth Cohort Study. NET-C identified a subnetwork of metabolites including lysophosphatidylcholines were nonnegatively associated with birth weight Net-C confirmed that higher Cu concentrations was related to reduced birth weight Abstract: Birth weight is a key consequence of environmental exposures and metabolic alterations and can influence lifelong health. While a number of methods have been used to examine associations of trace element (including essential nutrients and toxic metals) concentrations or metabolite concentrations with a health outcome, birth weight, studies evaluating how the coexistence of these factors impacts birth weight are extremely limited. Here, we present a novel algorithm NETwork Clusters (NET-C), to improve the prediction of outcome by considering the interactions of features in the network and then apply this method to predict birth weight by jointly modelling trace element and cord blood metabolite data. Specifically, by using trace element and/or metabolite subnetworks as groups, we apply group lasso to estimate birth weight. We conducted statistical simulation studies to examine how both sample size and correlations between grouped features and the outcome affect prediction performance. We showed that in terms of prediction error, ourGraphical abstract: Highlights: NET-C identified a network of features that predict birth weight. It outperforms hierarchical clustering and random forest regression in predicting birth weight in New Hampshire Birth Cohort Study. NET-C identified a subnetwork of metabolites including lysophosphatidylcholines were nonnegatively associated with birth weight Net-C confirmed that higher Cu concentrations was related to reduced birth weight Abstract: Birth weight is a key consequence of environmental exposures and metabolic alterations and can influence lifelong health. While a number of methods have been used to examine associations of trace element (including essential nutrients and toxic metals) concentrations or metabolite concentrations with a health outcome, birth weight, studies evaluating how the coexistence of these factors impacts birth weight are extremely limited. Here, we present a novel algorithm NETwork Clusters (NET-C), to improve the prediction of outcome by considering the interactions of features in the network and then apply this method to predict birth weight by jointly modelling trace element and cord blood metabolite data. Specifically, by using trace element and/or metabolite subnetworks as groups, we apply group lasso to estimate birth weight. We conducted statistical simulation studies to examine how both sample size and correlations between grouped features and the outcome affect prediction performance. We showed that in terms of prediction error, our proposed method outperformed other methods such as (a) group lasso with groups defined by hierarchical clustering, (b) random forest regression and (c) neural networks. We applied our method to data ascertained as part of the New Hampshire Birth Cohort Study on trace elements, metabolites and birth outcomes, adjusting for other covariates such as maternal body mass index (BMI) and enrollment age. Our proposed method can be applied to a variety of similarly structured high-dimensional datasets to predict health outcomes. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 90(2021)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 90(2021)
- Issue Display:
- Volume 90, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 90
- Issue:
- 2021
- Issue Sort Value:
- 2021-0090-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Outcome prediction -- Gaussian graphical model -- Lasso -- Dimensionality reduction -- Trace element exposures -- Metabolic network
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2020.107425 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15949.xml