Predicting cervical cancer biopsy results using demographic and epidemiological parameters: a custom stacked ensemble machine learning approach. Issue 1 (31st December 2022)
- Record Type:
- Journal Article
- Title:
- Predicting cervical cancer biopsy results using demographic and epidemiological parameters: a custom stacked ensemble machine learning approach. Issue 1 (31st December 2022)
- Main Title:
- Predicting cervical cancer biopsy results using demographic and epidemiological parameters: a custom stacked ensemble machine learning approach
- Authors:
- Chadaga, Krishnaraj
Prabhu, Srikanth
Sampathila, Niranjana
Chadaga, Rajagopala
K S, Swathi
Sengupta, Saptarshi - Abstract:
- Abstract: The human papillomavirus (HPV) is responsible for most cervical cancer cases worldwide. This gynecological carcinoma causes many deaths, even though it can be treated by removing malignant tissues at a preliminary stage. In many developing countries, patients do not undertake medical examinations due to the lack of awareness, hospital resources and high testing costs. Hence, it is vital to design a computer aided diagnostic method which can screen cervical cancer patients. In this research, we predict the probability risk of contracting this deadly disease using a custom stacked ensemble machine learning approach. The technique combines the results of several machine learning algorithms on multiple levels to produce reliable predictions. In the beginning, a deep exploratory analysis is conducted using univariate and multivariate statistics. Later, the one-way ANOVA, mutual information and Pearson's correlation techniques are utilized for feature selection. Since the data was imbalanced, the Borderline-SMOTE technique was used to balance the data. The final stacked machine learning model obtained an accuracy, precision, recall, F1-score, area under curve (AUC) and average precision of 98%, 97%, 99%, 98%, 100% and 100%, respectively. To make the model explainable and interpretable to clinicians, explainable artificial intelligence algorithms such as Shapley additive values (SHAP), local interpretable model agnostic explanation (LIME), random forest and ELI5 have beenAbstract: The human papillomavirus (HPV) is responsible for most cervical cancer cases worldwide. This gynecological carcinoma causes many deaths, even though it can be treated by removing malignant tissues at a preliminary stage. In many developing countries, patients do not undertake medical examinations due to the lack of awareness, hospital resources and high testing costs. Hence, it is vital to design a computer aided diagnostic method which can screen cervical cancer patients. In this research, we predict the probability risk of contracting this deadly disease using a custom stacked ensemble machine learning approach. The technique combines the results of several machine learning algorithms on multiple levels to produce reliable predictions. In the beginning, a deep exploratory analysis is conducted using univariate and multivariate statistics. Later, the one-way ANOVA, mutual information and Pearson's correlation techniques are utilized for feature selection. Since the data was imbalanced, the Borderline-SMOTE technique was used to balance the data. The final stacked machine learning model obtained an accuracy, precision, recall, F1-score, area under curve (AUC) and average precision of 98%, 97%, 99%, 98%, 100% and 100%, respectively. To make the model explainable and interpretable to clinicians, explainable artificial intelligence algorithms such as Shapley additive values (SHAP), local interpretable model agnostic explanation (LIME), random forest and ELI5 have been effectively utilized. The optimistic results indicate the potential of automated frameworks to assist doctors and medical professionals in diagnosing and screening potential cervical cancer patients. … (more)
- Is Part Of:
- Cogent engineering. Volume 9:Issue 1(2022)
- Journal:
- Cogent engineering
- Issue:
- Volume 9:Issue 1(2022)
- Issue Display:
- Volume 9, Issue 1 (2022)
- Year:
- 2022
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2022-0009-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12-31
- Subjects:
- cervical cancer -- diagnosis -- Explainable Artificial Intelligence -- machine learning -- ANOVA -- stacking
Engineering -- Periodicals
Technology -- Periodicals
Engineering
Technology
Periodicals
620 - Journal URLs:
- http://bibpurl.oclc.org/web/73324 ↗
http://cogentoa.tandfonline.com/journal/oaen20 ↗
http://www.tandfonline.com/toc/oaen20/1/1 ↗
http://www.tandfonline.com/ ↗
http://cogentoa.tandfonline.com/journal/oaps20 ↗ - DOI:
- 10.1080/23311916.2022.2143040 ↗
- Languages:
- English
- ISSNs:
- 2331-1916
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24494.xml