A machine learning based data modeling for medical diagnosis. (March 2023)
- Record Type:
- Journal Article
- Title:
- A machine learning based data modeling for medical diagnosis. (March 2023)
- Main Title:
- A machine learning based data modeling for medical diagnosis
- Authors:
- Mahoto, Naeem Ahmed
Shaikh, Asadullah
Sulaiman, Adel
Reshan, Mana Saleh Al
Rajab, Adel
Rajab, Khairan - Abstract:
- Abstract: High-dimensional medical data makes prediction a complex and difficult task. This study aims at modeling predictive models for medical data. Two datasets of medical data are applied in the study — one online available dataset (Heart Disease data) and another real clinical dataset (Eye Infection Data). A wide range of machine learning algorithms are applied in the modeling stage: Decision Tree, Multilayer Perceptron, Naive Bayesian, Random Forest, and Support Vector Machine. Furthermore, bagging and voting ensemble methods have also been applied with base learners. Both split and cross-validation methods are adopted for the model validation, and well-established evaluation metrics such as accuracy, precision, recall, and F-measure have been considered as evaluation metrics for the predictive models. The method applied for the modeling is comprised of two stages. The first stage uses available features for the predictions. In the second stage, selected features based on positive correlation are used. The adopted method is also for deep learning, especially Convolutional Neural Network (CNN) is applied to analyze the outcomes compared to conventional machine learning algorithms. The experimental results reveal that better predictions are achieved in the second stage. Besides, experiments also indicate split percentage produces better predictive models, and marginally better outcomes are observed in the presence of ensemble methods in comparison with base models. NBAbstract: High-dimensional medical data makes prediction a complex and difficult task. This study aims at modeling predictive models for medical data. Two datasets of medical data are applied in the study — one online available dataset (Heart Disease data) and another real clinical dataset (Eye Infection Data). A wide range of machine learning algorithms are applied in the modeling stage: Decision Tree, Multilayer Perceptron, Naive Bayesian, Random Forest, and Support Vector Machine. Furthermore, bagging and voting ensemble methods have also been applied with base learners. Both split and cross-validation methods are adopted for the model validation, and well-established evaluation metrics such as accuracy, precision, recall, and F-measure have been considered as evaluation metrics for the predictive models. The method applied for the modeling is comprised of two stages. The first stage uses available features for the predictions. In the second stage, selected features based on positive correlation are used. The adopted method is also for deep learning, especially Convolutional Neural Network (CNN) is applied to analyze the outcomes compared to conventional machine learning algorithms. The experimental results reveal that better predictions are achieved in the second stage. Besides, experiments also indicate split percentage produces better predictive models, and marginally better outcomes are observed in the presence of ensemble methods in comparison with base models. NB outperformed other algorithms with the highest accuracy rate as 88.90%, and MLP obtained 97.50% accuracy for Heart Disease and Eye Infection data, respectively, using 80–20 splits in the second stage. However, the CNN model performed poorly due to the size of the considered datasets. Highlights: Two medical datasets are applied in the study: one online available and the other real. Machine learning algorithms including DT, MLP, NB and RF are applied in the modeling. The method applied for the modeling is comprised of two stages. The experimental results reveal that better predictions are achieved in the second stage. NB outperformed for an online dataset with 88.90% accuracy and MLP with 97.50% for real dataset. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 81(2023)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 81(2023)
- Issue Display:
- Volume 81, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 81
- Issue:
- 2023
- Issue Sort Value:
- 2023-0081-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-03
- Subjects:
- Machine learning -- Medical data -- Classification -- Predictive models
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.104481 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25985.xml