Artificial intelligence based personalized predictive survival among colorectal cancer patients. (April 2023)
- Record Type:
- Journal Article
- Title:
- Artificial intelligence based personalized predictive survival among colorectal cancer patients. (April 2023)
- Main Title:
- Artificial intelligence based personalized predictive survival among colorectal cancer patients
- Authors:
- Susič, David
Syed-Abdul, Shabbir
Dovgan, Erik
Jonnagaddala, Jitendra
Gradišek, Anton - Abstract:
- Highlights: Eleven machine learning models were evaluated for short- and long-term survival prediction of colorectal cancer patients using Molecular and Cellular Oncology dataset The most important 20 predictor variables for prediction of both long- and short-term colorectal cancer survival were identified The best performing model was logistic regression, achieving an area under the receiver operating characteristic curve of 0.850 (0.014 SD, 0.840-0.860 95 % CI) for the 1-year, and 0.872 (0.014 SD, 0.861-0.882 95% CI) for the 5-year survival prediction. For all 5 (1- to 5-year) survival intervals, the probability of the top model agrees with the Kaplan-Meier estimate, both in the interval of one standard deviation and in the 95% confidence interval. Abstract: Background and Objective: Colorectal cancer is a major health concern. It is now the third most common cancer and the fourth leading cause of cancer mortality worldwide. The aim of this study was to evaluate the performance of machine learning algorithms for predicting survival of colorectal cancer patients 1 to 5 years after diagnosis, and identify the most important variables. Methods: A sample of 1236 patients diagnosed with colorectal cancer and 118 predictor variables has been used. The outcome of interest was a binary variable indicating whether the patient survived the number of years in question or not. 20 predictor variables were selected using mutual information score with the outcome. We implemented 11Highlights: Eleven machine learning models were evaluated for short- and long-term survival prediction of colorectal cancer patients using Molecular and Cellular Oncology dataset The most important 20 predictor variables for prediction of both long- and short-term colorectal cancer survival were identified The best performing model was logistic regression, achieving an area under the receiver operating characteristic curve of 0.850 (0.014 SD, 0.840-0.860 95 % CI) for the 1-year, and 0.872 (0.014 SD, 0.861-0.882 95% CI) for the 5-year survival prediction. For all 5 (1- to 5-year) survival intervals, the probability of the top model agrees with the Kaplan-Meier estimate, both in the interval of one standard deviation and in the 95% confidence interval. Abstract: Background and Objective: Colorectal cancer is a major health concern. It is now the third most common cancer and the fourth leading cause of cancer mortality worldwide. The aim of this study was to evaluate the performance of machine learning algorithms for predicting survival of colorectal cancer patients 1 to 5 years after diagnosis, and identify the most important variables. Methods: A sample of 1236 patients diagnosed with colorectal cancer and 118 predictor variables has been used. The outcome of interest was a binary variable indicating whether the patient survived the number of years in question or not. 20 predictor variables were selected using mutual information score with the outcome. We implemented 11 machine learning algorithms and evaluated their performance with a 5 by 2-fold cross-validation with stratified folds and with paired Student's t-tests. We compared the results with the Kaplan-Meier estimator and Cox's proportional hazard regression. Results: Using the 20 most important predictor variables for each of the survival years, the logistic regression algorithm achieved an area under the receiver operating characteristic curve of 0.850 (0.014 SD, 0.840-0.860 95 % CI) for the 1-year, and 0.872 (0.014 SD, 0.861-0.882 95% CI) for the 5-year survival prediction. Using only the 5 most important predictor variables, the corresponding values are 0.793 (0.020 SD, 0.778-0.807 95% CI) and 0.794 (0.011 SD, 0.785-0.802 95% CI). The most important variables for 1-year prediction were number of R residual, M distant metastasis, overall stage, probable recurrence within 5 years, and tumour length, whereas for 5-year prediction the most important were probable recurrence within 5 years, R residual, M distant metastasis, number of positive lymph nodes, and palliative chemotherapy. Biomarkers do not appear among the top 20 most important ones. For all survival intervals, the probability of the top model agrees with the Kaplan-Meier estimate, both in the interval of one standard deviation and in the 95% confidence interval. Conclusions: The findings suggest that machine learning algorithms can predict the survival probability of colorectal cancer patients and can be used to inform the patients and assist decision-making in clinical care management. In addition, this study unveils the most essential variables for estimating survival short- and long-term among patients with Colorectal cancer. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 231(2023)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 231(2023)
- Issue Display:
- Volume 231, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 231
- Issue:
- 2023
- Issue Sort Value:
- 2023-0231-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-04
- Subjects:
- Survival Prediction -- Colorectal Cancer -- Cancer Survival -- Machine Learning
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2023.107435 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26140.xml