A data science approach for early-stage prediction of Patient's susceptibility to acute side effects of advanced radiotherapy. (August 2021)
- Record Type:
- Journal Article
- Title:
- A data science approach for early-stage prediction of Patient's susceptibility to acute side effects of advanced radiotherapy. (August 2021)
- Main Title:
- A data science approach for early-stage prediction of Patient's susceptibility to acute side effects of advanced radiotherapy
- Authors:
- Aldraimli, Mahmoud
Soria, Daniele
Grishchuck, Diana
Ingram, Samuel
Lyon, Robert
Mistry, Anil
Oliveira, Jorge
Samuel, Robert
Shelley, Leila E.A.
Osman, Sarah
Dwek, Miriam V.
Azria, David
Chang-Claude, Jenny
Gutiérrez-Enríquez, Sara
De Santis, Maria Carmen
Rosenstein, Barry S.
De Ruysscher, Dirk
Sperk, Elena
Symonds, R. Paul
Stobart, Hilary
Vega, Ana
Veldeman, Liv
Webb, Adam
Talbot, Christopher J.
West, Catharine M.
Rattay, Tim
Chaussalet, Thierry J. - Abstract:
- Abstract: The prediction by classification of side effects incidence in a given medical treatment is a common challenge in medical research. Machine Learning (ML) methods are widely used in the areas of risk prediction and classification. The primary objective of such algorithms is to use several features to predict dichotomous responses (e.g., disease positive/negative). Similar to statistical inference modelling, ML modelling is subject to the class imbalance problem and is affected by the majority class, increasing the false-negative rate. In this study, seventy-nine ML models were built and evaluated to classify approximately 2000 participants from 26 hospitals in eight different countries into two groups of radiotherapy (RT) side effects incidence based on recorded observations from the international study of RT related toxicity "REQUITE". We also examined the effect of sampling techniques and cost-sensitive learning methods on the models when dealing with class imbalance. The combinations of such techniques used had a significant impact on the classification. They resulted in an improvement in incidence status prediction by shifting classifiers' attention to the minority group. The best classification model for RT acute toxicity prediction was identified based on domain experts' success criteria. The Area Under Receiver Operator Characteristic curve of the models tested with an isolated dataset ranged from 0.50 to 0.77. The scale of improved results is promising andAbstract: The prediction by classification of side effects incidence in a given medical treatment is a common challenge in medical research. Machine Learning (ML) methods are widely used in the areas of risk prediction and classification. The primary objective of such algorithms is to use several features to predict dichotomous responses (e.g., disease positive/negative). Similar to statistical inference modelling, ML modelling is subject to the class imbalance problem and is affected by the majority class, increasing the false-negative rate. In this study, seventy-nine ML models were built and evaluated to classify approximately 2000 participants from 26 hospitals in eight different countries into two groups of radiotherapy (RT) side effects incidence based on recorded observations from the international study of RT related toxicity "REQUITE". We also examined the effect of sampling techniques and cost-sensitive learning methods on the models when dealing with class imbalance. The combinations of such techniques used had a significant impact on the classification. They resulted in an improvement in incidence status prediction by shifting classifiers' attention to the minority group. The best classification model for RT acute toxicity prediction was identified based on domain experts' success criteria. The Area Under Receiver Operator Characteristic curve of the models tested with an isolated dataset ranged from 0.50 to 0.77. The scale of improved results is promising and will guide further development of models to predict RT acute toxicities. One model was optimised and found to be beneficial to identify patients who are at risk of developing acute RT early-stage toxicities as a result of undergoing breast RT ensuring relevant treatment interventions can be appropriately targeted. The design of the approach presented in this paper resulted in producing a preclinical-valid prediction model. The study was developed by a multi-disciplinary collaboration of data scientists, medical physicists, oncologists and surgeons in the UK Radiotherapy Machine Learning Network. Highlights: Acute toxicities can be predicted with machine learning. The machine learning models were validated by an external (isolated) cohort. Modelling clinical data only offers an inexpensive prediction of acute toxicities. This is the first study to use the clinical features only at a CTCAE >3 setting. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 135(2021)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 135(2021)
- Issue Display:
- Volume 135, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 135
- Issue:
- 2021
- Issue Sort Value:
- 2021-0135-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-08
- Subjects:
- Classification -- REQUITE -- Machine learning -- Imbalanced learning -- Radiotherapy -- SMOTE -- Meta-learning -- Desquamation -- Early toxicities
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2021.104624 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18878.xml