A systematic map of medical data preprocessing in knowledge discovery. (August 2018)
- Record Type:
- Journal Article
- Title:
- A systematic map of medical data preprocessing in knowledge discovery. (August 2018)
- Main Title:
- A systematic map of medical data preprocessing in knowledge discovery
- Authors:
- Idri, A.
Benhar, H.
Fernández-Alemán, J.L.
Kadi, I. - Abstract:
- Highlights: Perform a systematic map of 110 studies on the application of data preprocessing in healthcare. Analyze and classify the selected studies according to their publication years and channels, research type, empirical type and contribution type. Researchers have paid a considerable amount of attention to preprocessing in medical DM in last decade. A significant number of the selected studies used data reduction and cleaning preprocessing tasks. The discipline in which preprocessing has received most attention are: cardiology, endocrinology and oncology. Abstract: Background and objective: Datamining (DM) has, over the last decade, received increased attention in the medical domain and has been widely used to analyze medical datasets in order to extract useful knowledge and previously unknown patterns. However, historical medical data can often comprise inconsistent, noisy, imbalanced, missing and high dimensional data. These challenges lead to a serious bias in predictive modeling and reduce the performance of DM techniques. Data preprocessing is, therefore, an essential step in knowledge discovery as regards improving the quality of data and making it appropriate and suitable for DM techniques. The objective of this paper is to review the use of preprocessing techniques in clinical datasets. Methods: We performed a systematic map of studies regarding the application of data preprocessing to healthcare and published between January 2000 and December 2017. A searchHighlights: Perform a systematic map of 110 studies on the application of data preprocessing in healthcare. Analyze and classify the selected studies according to their publication years and channels, research type, empirical type and contribution type. Researchers have paid a considerable amount of attention to preprocessing in medical DM in last decade. A significant number of the selected studies used data reduction and cleaning preprocessing tasks. The discipline in which preprocessing has received most attention are: cardiology, endocrinology and oncology. Abstract: Background and objective: Datamining (DM) has, over the last decade, received increased attention in the medical domain and has been widely used to analyze medical datasets in order to extract useful knowledge and previously unknown patterns. However, historical medical data can often comprise inconsistent, noisy, imbalanced, missing and high dimensional data. These challenges lead to a serious bias in predictive modeling and reduce the performance of DM techniques. Data preprocessing is, therefore, an essential step in knowledge discovery as regards improving the quality of data and making it appropriate and suitable for DM techniques. The objective of this paper is to review the use of preprocessing techniques in clinical datasets. Methods: We performed a systematic map of studies regarding the application of data preprocessing to healthcare and published between January 2000 and December 2017. A search string was determined on the basis of the mapping questions and the PICO categories. The search string was then applied in digital databases covering the fields of computer science and medical informatics in order to identify relevant studies. The studies were initially selected by reading their titles, abstracts and keywords. Those that were selected at that stage were then reviewed using a set of inclusion and exclusion criteria in order to eliminate any that were not relevant. This process resulted in 126 primary studies. Results: Selected studies were analyzed and classified according to their publication years and channels, research type, empirical type and contribution type. The findings of this mapping study revealed that researchers have paid a considerable amount of attention to preprocessing in medical DM in last decade. A significant number of the selected studies used data reduction and cleaning preprocessing tasks. Moreover, the disciplines in which preprocessing have received most attention are: cardiology, endocrinology and oncology. Conclusions: Researchers should develop and implement standards for an effective integration of multiple medical data types. Moreover, we identified the need to perform literature reviews. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 162(2018)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 162(2018)
- Issue Display:
- Volume 162, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 162
- Issue:
- 2018
- Issue Sort Value:
- 2018-0162-2018-0000
- Page Start:
- 69
- Page End:
- 85
- Publication Date:
- 2018-08
- Subjects:
- Medical datamining -- Data preprocessing -- Clinical data -- Mapping study -- Electronic heath records
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2018.05.007 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6903.xml