A novel weighted distance threshold method for handling medical missing values. (July 2020)
- Record Type:
- Journal Article
- Title:
- A novel weighted distance threshold method for handling medical missing values. (July 2020)
- Main Title:
- A novel weighted distance threshold method for handling medical missing values
- Authors:
- Cheng, Ching-Hsue
Chang, Jing-Rong
Huang, Hao-Hsuan - Abstract:
- Abstract: Data in the medical field often contain missing values and may result in biased research results. Therefore, the objective of this work is to propose a new imputation method, a novel weighted distance threshold method, to impute missing values. After several experiments, we find that the proposed imputation method has the following benefits. (1) The proposed method with purity can reassign instances into the nearest class of the dataset, and the purity computation can filter outliers; (2) The proposed method redefines the degree of missing values and can determine attributes and instances relative to the missing values in different datasets; and (3) The proposed method need not set the k value of the nearest neighborhood because this study identifies the k value based on the best threshold to calculate purity to enhance the results of imputation. In addition, the distance threshold can adjust the optimal nearest neighborhood to estimate missing values. This study implements several experiments to compare the proposed method with other imputation methods using different missing types, missing degrees, and types of datasets. The results indicate that the proposed imputation method is better than the listed methods. Moreover, this study uses the stroke dataset from the International Stroke Trial (IST) to verify whether the proposed method can be effectively applied in practice, and the results show that the proposed method achieves 90% accuracy in the Stroke dataset.
- Is Part Of:
- Computers in biology and medicine. Volume 122(2020)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 122(2020)
- Issue Display:
- Volume 122, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 122
- Issue:
- 2020
- Issue Sort Value:
- 2020-0122-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-07
- Subjects:
- Stroke disease -- Missing values -- Imputation technique -- Distance threshold
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2020.103824 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13404.xml