A machine learning-based diagnosis modelling of type 2 diabetes mellitus with environmental metal exposure. (June 2023)
- Record Type:
- Journal Article
- Title:
- A machine learning-based diagnosis modelling of type 2 diabetes mellitus with environmental metal exposure. (June 2023)
- Main Title:
- A machine learning-based diagnosis modelling of type 2 diabetes mellitus with environmental metal exposure
- Authors:
- Zhao, Min
Wan, Jin
Qin, Wenzhi
Huang, Xin
Chen, Guangdi
Zhao, Xinyuan - Abstract:
- Highlights: A novel feature selection that entailed both feature importance and multicollinearity concerns was proposed to identify potential factors for the diagnosis of type 2 diabetes mellitus. Compared with the AUC values of classifiers in the most related researches, the AUC value of the novel ensemble classifier was raised to 0.85. The potential values of urinary and dietary metal exposure to the diagnosis of type 2 diabetes mellitus have been further determined. Abstract: Background and Objective: Increasing and compelling evidence has been proved that urinary and dietary metal exposure are underappreciated but potentially modifiable biomarkers for type 2 diabetes mellitus (T2DM). The aims of this study were (1) to identify the key potential biomarkers which contributed to T2DM with effective and parsimonious features and (2) to assess the utility of baseline variables and metal exposure in the diagnosis of T2DM. Methods: Based on the National Health and Nutrition Examination Survey (NHANES), we selected 9822 screening records with 82 significant variables covering demographics, lifestyle, anthropometric measures, diet and metal exposure for this study. Combining extreme gradient boosting (XGBoost), random forest and light gradient boosting machine (lightGBM), a soft voting ensemble model was proposed to measure the importance of 82 features. With this soft voting ensemble model and variance inflation factor (VIF), strong multicollinear features with low importanceHighlights: A novel feature selection that entailed both feature importance and multicollinearity concerns was proposed to identify potential factors for the diagnosis of type 2 diabetes mellitus. Compared with the AUC values of classifiers in the most related researches, the AUC value of the novel ensemble classifier was raised to 0.85. The potential values of urinary and dietary metal exposure to the diagnosis of type 2 diabetes mellitus have been further determined. Abstract: Background and Objective: Increasing and compelling evidence has been proved that urinary and dietary metal exposure are underappreciated but potentially modifiable biomarkers for type 2 diabetes mellitus (T2DM). The aims of this study were (1) to identify the key potential biomarkers which contributed to T2DM with effective and parsimonious features and (2) to assess the utility of baseline variables and metal exposure in the diagnosis of T2DM. Methods: Based on the National Health and Nutrition Examination Survey (NHANES), we selected 9822 screening records with 82 significant variables covering demographics, lifestyle, anthropometric measures, diet and metal exposure for this study. Combining extreme gradient boosting (XGBoost), random forest and light gradient boosting machine (lightGBM), a soft voting ensemble model was proposed to measure the importance of 82 features. With this soft voting ensemble model and variance inflation factor (VIF), strong multicollinear features with low importance scores were further removed from candidate biomarkers. Then, a soft voting ensemble classifier was adopted to demonstrate the efficiency of the proposed feature selection method. Results: With the novel feature selection method, 12 baseline variables and 3 metal variables were selected to detect patients at risk for T2DM in our study. For metal variables, the dietary copper (Cu), urinary cadmium (Cd) and urinary mercury (Hg) metals were selected as the most remarkable metal exposure and the corresponding P-values were all less than 0.05. In a classification model of T2DM with 12 baseline biomarkers, the addition of 3 metal exposure improved the classification accuracy of T2DM from a traditional area under the curve (AUC) 0.792 of the receiver operating characteristic (ROC) to an AUC 0.847. Conclusions: This was the first demonstration of T2DM classification with machine learning under urinary and dietary metal exposure. Improved prediction precision illustrated the effectiveness of the proposed machine learning-based diagnosis model facilitated lifestyle/dietary intervention for T2DM prevention. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 235(2023)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 235(2023)
- Issue Display:
- Volume 235, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 235
- Issue:
- 2023
- Issue Sort Value:
- 2023-0235-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-06
- Subjects:
- Machine learning -- Environmental metal exposure -- Statistical analysis -- Type 2 diabetes mellitus
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2023.107537 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27095.xml