An accurate and explainable ensemble learning method for carotid plaque prediction in an asymptomatic population. (June 2022)
- Record Type:
- Journal Article
- Title:
- An accurate and explainable ensemble learning method for carotid plaque prediction in an asymptomatic population. (June 2022)
- Main Title:
- An accurate and explainable ensemble learning method for carotid plaque prediction in an asymptomatic population
- Authors:
- Wu, Dan
Cui, Guosheng
Huang, Xiaoxiang
Chen, Yining
Liu, Guanzheng
Ren, Lijie
Li, Ye - Abstract:
- Highlights: As far as we know, we are the first to assess the carotid plaque occurrence using a machine learning method. High accuracy is achieved in carotid plaque prediction, even when data are missing. The underlying causal reasoning for the machine decisions is explainable, enabling subclinical interventions.The proposed method is a promising tool for large-scale screening of stroke. Abstract: Background and objective: The identification of carotid plaque, one of the most crucial tasks in stroke screening, is of great significance in the assessment of subclinical atherosclerosis and preventing the onset of stroke. However, traditional ultrasound examination is not prevalent or cost-effective for asymptomatic people, particularly low-income individuals in rural areas. Thus, it is necessary to develop an accurate and explainable model for early identification of the risk of plaque prevalence that can help in the primary prevention of stroke. Methods: We developed an ensemble learning method to predict the occurrence of carotid plaques. A dataset comprising 1440 subjects (50% with plaques and 50% without plaques) and ten-fold cross-validation were utilized to evaluate the model performance. Four machine learning methods (extreme gradient boosting (XGBoost), gradient boosting decision tree, random forest, and support vector machine) were evaluated. Subsequently, the interpretability of the XGBoost model, which provided the best performance, was analyzed from three aspects:Highlights: As far as we know, we are the first to assess the carotid plaque occurrence using a machine learning method. High accuracy is achieved in carotid plaque prediction, even when data are missing. The underlying causal reasoning for the machine decisions is explainable, enabling subclinical interventions.The proposed method is a promising tool for large-scale screening of stroke. Abstract: Background and objective: The identification of carotid plaque, one of the most crucial tasks in stroke screening, is of great significance in the assessment of subclinical atherosclerosis and preventing the onset of stroke. However, traditional ultrasound examination is not prevalent or cost-effective for asymptomatic people, particularly low-income individuals in rural areas. Thus, it is necessary to develop an accurate and explainable model for early identification of the risk of plaque prevalence that can help in the primary prevention of stroke. Methods: We developed an ensemble learning method to predict the occurrence of carotid plaques. A dataset comprising 1440 subjects (50% with plaques and 50% without plaques) and ten-fold cross-validation were utilized to evaluate the model performance. Four machine learning methods (extreme gradient boosting (XGBoost), gradient boosting decision tree, random forest, and support vector machine) were evaluated. Subsequently, the interpretability of the XGBoost model, which provided the best performance, was analyzed from three aspects: feature importance, feature effect on prediction model, and feature effect on prediction decision for a specific subject. Results: The XGBoost algorithm provided the best performance (sensitivity: 0.8678, specificity: 0.8592, accuracy: 0.8632, F1 score: 0.8621, area under the curve: 0.8635) in carotid plaque prediction and also had excellent performance under missing data circumstances. Further, interpretability analysis showed that the decisions of the XGBoost model were highly congruent with clinical knowledge. Conclusion: The model results are superior to those of state-of-the-art methods. Thus, it is a promising carotid plaque prediction tool that could be used in the primary prevention of stroke. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 221(2022)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 221(2022)
- Issue Display:
- Volume 221, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 221
- Issue:
- 2022
- Issue Sort Value:
- 2022-0221-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06
- Subjects:
- Machine learning -- Explainable model -- Carotid plaque -- Prediction -- Primary prevention of stroke
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2022.106842 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22100.xml