A novel model to optimize multiple imputation algorithm for missing data using evolution methods. (July 2022)
- Record Type:
- Journal Article
- Title:
- A novel model to optimize multiple imputation algorithm for missing data using evolution methods. (July 2022)
- Main Title:
- A novel model to optimize multiple imputation algorithm for missing data using evolution methods
- Authors:
- Mohammed, Yasser Salaheldin
Abdelkader, Hatem
Pławiak, Paweł
Hammad, Mohamed - Abstract:
- Highlights: Propose a novel approach for imputing MD by fully optimizing the MICE algorithm. Propose a new method that works with all types of MD as MAR, MNAR and MCAR. Obtaining the best fitness values for MD from patients by combining MI with GA. Yielded 92.72 as the utmost fitness value with MAE after our second optimization. Abstract: The concept of missing data is considered significant when applying statistical methods to a dataset and the quality of the data analysis results is based on the correct data completeness. As a result, improving missing data filling processes is vital in order to give more reliable data throughout the phase of analysis. Here, we present a novel method for optimizing multiple regression imputation processes and obtaining the best fitness values for missing data from patients by combining multiple imputations with a genetic algorithm. To train and assess our proposed method, we employed 583 patient records from a publicly available database, divided into 416 records of liver patients and 167 records of the non-liver patients. The proposed approach offers the largest improvement for missing data findings, according to the results. Instead of employing the normal equation in multiple imputations, which yielded 92.72 as the utmost fitness value with Mean Absolute Error (MAE) 0.5877 from 1.1840 after our second optimization, we were able to achieve a fitness value of 233. The proposed approach might be tested using a large database and used inHighlights: Propose a novel approach for imputing MD by fully optimizing the MICE algorithm. Propose a new method that works with all types of MD as MAR, MNAR and MCAR. Obtaining the best fitness values for MD from patients by combining MI with GA. Yielded 92.72 as the utmost fitness value with MAE after our second optimization. Abstract: The concept of missing data is considered significant when applying statistical methods to a dataset and the quality of the data analysis results is based on the correct data completeness. As a result, improving missing data filling processes is vital in order to give more reliable data throughout the phase of analysis. Here, we present a novel method for optimizing multiple regression imputation processes and obtaining the best fitness values for missing data from patients by combining multiple imputations with a genetic algorithm. To train and assess our proposed method, we employed 583 patient records from a publicly available database, divided into 416 records of liver patients and 167 records of the non-liver patients. The proposed approach offers the largest improvement for missing data findings, according to the results. Instead of employing the normal equation in multiple imputations, which yielded 92.72 as the utmost fitness value with Mean Absolute Error (MAE) 0.5877 from 1.1840 after our second optimization, we were able to achieve a fitness value of 233. The proposed approach might be tested using a large database and used in Hepatocellular carcinoma (HCC) labs to help clinicians make accurate diagnoses. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 76(2022)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 76(2022)
- Issue Display:
- Volume 76, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 76
- Issue:
- 2022
- Issue Sort Value:
- 2022-0076-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07
- Subjects:
- Multiple imputation -- Fitness value -- Multiple regression -- Genetic algorithm -- Missing data -- Optimization
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.103661 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21539.xml