New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. (August 2020)
- Record Type:
- Journal Article
- Title:
- New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. (August 2020)
- Main Title:
- New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers
- Authors:
- Nalić, Jasmina
Martinović, Goran
Žagar, Drago - Abstract:
- Abstract: The aim of this paper is to propose a new hybrid data mining model based on combination of various feature selection and ensemble learning classification algorithms, in order to support decision making process. The model is built through several stages. In the first stage, initial dataset is preprocessed and apart of applying different preprocessing techniques, we paid a great attention to the feature selection. Five different feature selection algorithms were applied and their results, based on ROC and accuracy measures of logistic regression algorithm, were combined based on different voting types. We also proposed a new voting method, called if_any, that outperformed all other voting methods, as well as a single feature selection algorithm's results. In the next stage, a four different classification algorithms, including generalized linear model, support vector machine, naive Bayes and decision tree, were performed based on dataset obtained in the feature selection process. These classifiers were combined in eight different ensemble models using soft voting method. Using the real dataset, the experimental results show that hybrid model that is based on features selected by if_any voting method and ensemble GLM + DT model performs the highest performance and outperforms all other ensemble and single classifier models.
- Is Part Of:
- Advanced engineering informatics. Volume 45(2020)
- Journal:
- Advanced engineering informatics
- Issue:
- Volume 45(2020)
- Issue Display:
- Volume 45, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 45
- Issue:
- 2020
- Issue Sort Value:
- 2020-0045-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08
- Subjects:
- Credit scoring -- Data mining -- Ensemble classifier -- Feature selection -- Hybrid model
Computer-aided engineering -- Periodicals
Engineering -- Data processing -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14740346 ↗
http://books.google.com/books?id=KhFVAAAAMAAJ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.aei.2020.101130 ↗
- Languages:
- English
- ISSNs:
- 1474-0346
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0696.851100
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 13568.xml