Prediction of Driver Gene Matching in Lung Cancer NOG/PDX Models Based on Artificial Intelligence. (August 2022)
- Record Type:
- Journal Article
- Title:
- Prediction of Driver Gene Matching in Lung Cancer NOG/PDX Models Based on Artificial Intelligence. (August 2022)
- Main Title:
- Prediction of Driver Gene Matching in Lung Cancer NOG/PDX Models Based on Artificial Intelligence
- Authors:
- He, Yayi
Guo, Haoyue
Diao, Li
Chen, Yu
Zhu, Junjie
Fernando, Hiran C.
Rivas, Diego Gonzalez
Qi, Hui
Dai, Chunlei
Tang, Xuzhen
Zhu, Jun
Dai, Jiawei
He, Kan
Chan, Dan
Yang, Yang - Abstract:
- Abstract: Patient-derived tumor xenografts (PDXs) are a powerful tool for drug discovery and screening in cancer. However, current studies have led to little understanding of genotype mismatches in PDXs, leading to massive economic losses. Here, we established PDX models from 53 lung cancer patients with a genotype matching rate of 79.2% (42/53). Furthermore, 17 clinicopathological features were examined and input in stepwise logistic regression (LR) models based on the lowest Akaike information criterion (AIC), least absolute shrinkage and selection operator (LASSO)-LR, support vector machine (SVM) recursive feature elimination (SVM-RFE), extreme gradient boosting (XGBoost), gradient boosting and categorical features (CatBoost), and the synthetic minority oversampling technique (SMOTE). Finally, the performance of all models was evaluated by the accuracy, area under the receiver operating characteristic curve (AUC), and F1 score in 100 testing groups. Two multivariable LR models revealed that age, number of driver gene mutations, epidermal growth factor receptor ( EGFR ) gene mutations, type of prior chemotherapy, prior tyrosine kinase inhibitor (TKI) therapy, and the source of the sample were powerful predictors. Moreover, CatBoost (mean accuracy = 0.960; mean AUC = 0.939; mean F1 score = 0.908) and the eight-feature SVM-RFE (mean accuracy = 0.950; mean AUC = 0.934; mean F1 score = 0.903) showed the best performance among the algorithms. Meanwhile, application of the SMOTEAbstract: Patient-derived tumor xenografts (PDXs) are a powerful tool for drug discovery and screening in cancer. However, current studies have led to little understanding of genotype mismatches in PDXs, leading to massive economic losses. Here, we established PDX models from 53 lung cancer patients with a genotype matching rate of 79.2% (42/53). Furthermore, 17 clinicopathological features were examined and input in stepwise logistic regression (LR) models based on the lowest Akaike information criterion (AIC), least absolute shrinkage and selection operator (LASSO)-LR, support vector machine (SVM) recursive feature elimination (SVM-RFE), extreme gradient boosting (XGBoost), gradient boosting and categorical features (CatBoost), and the synthetic minority oversampling technique (SMOTE). Finally, the performance of all models was evaluated by the accuracy, area under the receiver operating characteristic curve (AUC), and F1 score in 100 testing groups. Two multivariable LR models revealed that age, number of driver gene mutations, epidermal growth factor receptor ( EGFR ) gene mutations, type of prior chemotherapy, prior tyrosine kinase inhibitor (TKI) therapy, and the source of the sample were powerful predictors. Moreover, CatBoost (mean accuracy = 0.960; mean AUC = 0.939; mean F1 score = 0.908) and the eight-feature SVM-RFE (mean accuracy = 0.950; mean AUC = 0.934; mean F1 score = 0.903) showed the best performance among the algorithms. Meanwhile, application of the SMOTE improved the predictive capability of most models, except CatBoost. Based on the SMOTE, the ensemble classifier of single models achieved the highest accuracy (mean = 0.975), AUC (mean = 0.949), and F1 score (mean = 0.938). In conclusion, we established an optimal predictive model to screen lung cancer patients for non-obese diabetic (NOD)/Shi-scid, interleukin-2 receptor (IL-2R) γ null (NOG)/PDX models and offer a general approach for building predictive models. … (more)
- Is Part Of:
- Engineering. Volume 15(2022)
- Journal:
- Engineering
- Issue:
- Volume 15(2022)
- Issue Display:
- Volume 15, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 15
- Issue:
- 2022
- Issue Sort Value:
- 2022-0015-2022-0000
- Page Start:
- 102
- Page End:
- 114
- Publication Date:
- 2022-08
- Subjects:
- Machine learning -- Patient-derived tumor xenografts -- NOG mice
Engineering -- Periodicals
Engineering -- China -- Periodicals
620.005 - Journal URLs:
- http://www.sciencedirect.com/science/journal/20958099 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.eng.2021.06.017 ↗
- Languages:
- English
- ISSNs:
- 2095-8099
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23872.xml