Development of an interpretable machine learning model associated with heavy metals' exposure to identify coronary heart disease among US adults via SHAP: Findings of the US NHANES from 2003 to 2018. (January 2023)
- Record Type:
- Journal Article
- Title:
- Development of an interpretable machine learning model associated with heavy metals' exposure to identify coronary heart disease among US adults via SHAP: Findings of the US NHANES from 2003 to 2018. (January 2023)
- Main Title:
- Development of an interpretable machine learning model associated with heavy metals' exposure to identify coronary heart disease among US adults via SHAP: Findings of the US NHANES from 2003 to 2018
- Authors:
- Li, Xi
Zhao, Yang
Zhang, Dongdong
Kuang, Lei
Huang, Hao
Chen, Weiling
Fu, Xueru
Wu, Yuying
Li, Tianze
Zhang, Jinli
Yuan, Lijun
Hu, Huifang
Liu, Yu
Zhang, Ming
Hu, Fulan
Sun, Xizhuo
Hu, Dongsheng - Abstract:
- Abstract: Limited information is available on the links between heavy metals' exposure and coronary heart disease (CHD). We aim to establish an efficient and explainable machine learning (ML) model that associates heavy metals' exposure with CHD identification. Our datasets for investigating the associations between heavy metals and CHD were sourced from the US National Health and Nutrition Examination Survey (US NHANES, 2003–2018). Five ML models were established to identify CHD by heavy metals' exposure. Further, 11 discrimination characteristics were used to test the strength of the models. The optimally performing model was selected for identification. Finally, the SHapley Additive exPlanations (SHAP) tool was used for interpreting the features to visualize the selected model's decision-making capacity. In total, 12, 554 participants were eligible for this study. The best performing random forest classifier (RF) based on 13 heavy metals to identify CHD was chosen (AUC: 0.827; 95%CI: 0.777–0.877; accuracy: 95.9%). SHAP values indicated that cesium (1.62), thallium (1.17), antimony (1.63), dimethylarsonic acid (0.91), barium (0.76), arsenous acid (0.79), total arsenic (0.01) in urine, and lead (3.58) and cadmium (4.66) in blood positively contributed to the model, while cobalt (−0.15), cadmium (−2.93), and uranium (−0.13) in urine negatively contributed to the model. The RF model was efficient, accurate, and robust in identifying an association between heavy metals'Abstract: Limited information is available on the links between heavy metals' exposure and coronary heart disease (CHD). We aim to establish an efficient and explainable machine learning (ML) model that associates heavy metals' exposure with CHD identification. Our datasets for investigating the associations between heavy metals and CHD were sourced from the US National Health and Nutrition Examination Survey (US NHANES, 2003–2018). Five ML models were established to identify CHD by heavy metals' exposure. Further, 11 discrimination characteristics were used to test the strength of the models. The optimally performing model was selected for identification. Finally, the SHapley Additive exPlanations (SHAP) tool was used for interpreting the features to visualize the selected model's decision-making capacity. In total, 12, 554 participants were eligible for this study. The best performing random forest classifier (RF) based on 13 heavy metals to identify CHD was chosen (AUC: 0.827; 95%CI: 0.777–0.877; accuracy: 95.9%). SHAP values indicated that cesium (1.62), thallium (1.17), antimony (1.63), dimethylarsonic acid (0.91), barium (0.76), arsenous acid (0.79), total arsenic (0.01) in urine, and lead (3.58) and cadmium (4.66) in blood positively contributed to the model, while cobalt (−0.15), cadmium (−2.93), and uranium (−0.13) in urine negatively contributed to the model. The RF model was efficient, accurate, and robust in identifying an association between heavy metals' exposure and CHD among US NHANES 2003–2018 participants. Cesium, thallium, antimony, dimethylarsonic acid, barium, arsenous acid, and total arsenic in urine, and lead and cadmium in blood show positive relationships with CHD, while cobalt, cadmium, and uranium in urine show negative relationships with CHD. Graphical abstract: Image 1 Highlights: An explainable ML model for CHD identification was built by using multi-source data over 16 years. RF model associated with heavy metal for identifying CHD was efficient, accurate, and robust. Shap explained positive and negative relationships between heavy metals with CHD. … (more)
- Is Part Of:
- Chemosphere. Volume 311:Part 1(2023)
- Journal:
- Chemosphere
- Issue:
- Volume 311:Part 1(2023)
- Issue Display:
- Volume 311, Issue 1, Part 1 (2023)
- Year:
- 2023
- Volume:
- 311
- Issue:
- 1
- Part:
- 1
- Issue Sort Value:
- 2023-0311-0001-0001
- Page Start:
- Page End:
- Publication Date:
- 2023-01
- Subjects:
- Coronary heart disease -- Heavy metals -- Machine learning -- SHAP
Pollution -- Periodicals
Pollution -- Physiological effect -- Periodicals
Environmental sciences -- Periodicals
Atmospheric chemistry -- Periodicals
551.511 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00456535/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.chemosphere.2022.137039 ↗
- Languages:
- English
- ISSNs:
- 0045-6535
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3172.280000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 24413.xml