Using Machine Learning to Identify Metabolomic Signatures of Pediatric Chronic Kidney Disease Etiology. Issue 2 (February 2022)
- Record Type:
- Journal Article
- Title:
- Using Machine Learning to Identify Metabolomic Signatures of Pediatric Chronic Kidney Disease Etiology. Issue 2 (February 2022)
- Main Title:
- Using Machine Learning to Identify Metabolomic Signatures of Pediatric Chronic Kidney Disease Etiology
- Authors:
- Lee, Arthur M.
Hu, Jian
Xu, Yunwen
Abraham, Alison G.
Xiao, Rui
Coresh, Josef
Rebholz, Casey
Chen, Jingsha
Rhee, Eugene P.
Feldman, Harold I.
Ramachandran, Vasan S.
Kimmel, Paul L.
Warady, Bradley A.
Furth, Susan L.
Denburg, Michelle R. - Abstract:
- Significance Statement: Machine learning used with biostatistics identified metabolomic signatures in the plasma of pediatric patients with CKD, providing clues to cause. Dysmetabolism in the sphingomyelin-ceramide axis is associated with both FSGS and the aplasia/dysplasia/hypoplasia spectrum. Pediatric FSGS is associated with elevated plasmalogen levels, in contrast to reports of associations with plasmalogen deficiencies. Our strategy also revealed associations of obstructive uropathy with gut-derived histidines and of reflux nephropathy with indole-tryptophans. Visual Abstract: Abstract : Background: Untargeted plasma metabolomic profiling combined with machine learning (ML) may lead to discovery of metabolic profiles that inform our understanding of pediatric CKD causes. We sought to identify metabolomic signatures in pediatric CKD based on diagnosis: FSGS, obstructive uropathy (OU), aplasia/dysplasia/hypoplasia (A/D/H), and reflux nephropathy (RN). Methods: Untargeted metabolomic quantification (GC-MS/LC-MS, Metabolon) was performed on plasma from 702 Chronic Kidney Disease in Children study participants ( n : FSGS=63, OU=122, A/D/H=109, and RN=86). Lasso regression was used for feature selection, adjusting for clinical covariates. Four methods were then applied to stratify significance: logistic regression, support vector machine, random forest, and extreme gradient boosting. ML training was performed on 80% total cohort subsets and validated on 20% holdout subsets.Significance Statement: Machine learning used with biostatistics identified metabolomic signatures in the plasma of pediatric patients with CKD, providing clues to cause. Dysmetabolism in the sphingomyelin-ceramide axis is associated with both FSGS and the aplasia/dysplasia/hypoplasia spectrum. Pediatric FSGS is associated with elevated plasmalogen levels, in contrast to reports of associations with plasmalogen deficiencies. Our strategy also revealed associations of obstructive uropathy with gut-derived histidines and of reflux nephropathy with indole-tryptophans. Visual Abstract: Abstract : Background: Untargeted plasma metabolomic profiling combined with machine learning (ML) may lead to discovery of metabolic profiles that inform our understanding of pediatric CKD causes. We sought to identify metabolomic signatures in pediatric CKD based on diagnosis: FSGS, obstructive uropathy (OU), aplasia/dysplasia/hypoplasia (A/D/H), and reflux nephropathy (RN). Methods: Untargeted metabolomic quantification (GC-MS/LC-MS, Metabolon) was performed on plasma from 702 Chronic Kidney Disease in Children study participants ( n : FSGS=63, OU=122, A/D/H=109, and RN=86). Lasso regression was used for feature selection, adjusting for clinical covariates. Four methods were then applied to stratify significance: logistic regression, support vector machine, random forest, and extreme gradient boosting. ML training was performed on 80% total cohort subsets and validated on 20% holdout subsets. Important features were selected based on being significant in at least two of the four modeling approaches. We additionally performed pathway enrichment analysis to identify metabolic subpathways associated with CKD cause. Results: ML models were evaluated on holdout subsets with receiver-operator and precision-recall area-under-the-curve, F1 score, and Matthews correlation coefficient. ML models outperformed no-skill prediction. Metabolomic profiles were identified based on cause. FSGS was associated with the sphingomyelin-ceramide axis. FSGS was also associated with individual plasmalogen metabolites and the subpathway. OU was associated with gut microbiome–derived histidine metabolites. Conclusion: ML models identified metabolomic signatures based on CKD cause. Using ML techniques in conjunction with traditional biostatistics, we demonstrated that sphingomyelin-ceramide and plasmalogen dysmetabolism are associated with FSGS and that gut microbiome–derived histidine metabolites are associated with OU. … (more)
- Is Part Of:
- Journal of the American Society of Nephrology. Volume 33:Issue 2(2022)
- Journal:
- Journal of the American Society of Nephrology
- Issue:
- Volume 33:Issue 2(2022)
- Issue Display:
- Volume 33, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 33
- Issue:
- 2
- Issue Sort Value:
- 2022-0033-0002-0000
- Page Start:
- 375
- Page End:
- 386
- Publication Date:
- 2022-02
- Subjects:
- metabolomics -- pediatric nephrology -- chronic kidney disease -- machine learning -- machine learning collection
- DOI:
- 10.1681/ASN.2021040538 ↗
- Languages:
- English
- ISSNs:
- 1046-6673
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 26566.xml