A comparative study of combining tree‐based feature selection methods and classifiers in personal loan default prediction. (11th May 2022)
- Record Type:
- Journal Article
- Title:
- A comparative study of combining tree‐based feature selection methods and classifiers in personal loan default prediction. (11th May 2022)
- Main Title:
- A comparative study of combining tree‐based feature selection methods and classifiers in personal loan default prediction
- Authors:
- Guo, Weidong
Zhou, Zach Zhizhong - Abstract:
- Abstract: Personal credit data usually contain a large number of features, some of which do not significantly contribute to the performance of default prediction models. Screening features through appropriate methods is essential to improve the efficiency of prediction models. However, little attention has been paid to feature selection methods in the area of personal loan default prediction. In this study, we employ random forest (RF), XGBoost, Adaptive Boosting (AdaBoost), Categorical Boosting (CatBoost), and Light Gradient Boosting Machine (LightGBM) as base algorithms of wrapper and embedded methods to select features and use these algorithms as classifiers to predict personal loan default. We find that when classical filter methods are used to select features, the number of selected features needs to be large enough to enable tree‐based classifiers to get their best performance. However, when the tree‐based algorithm is used to select features, it only needs to select a small number of features to deliver a satisfactory classification performance. AdaBoost, Chi2, and F ‐score are found to be ideal feature selection methods in the area of personal credit default prediction. Moreover, we find that it is better to use different algorithms in feature selection and classification; AdaBoost and CatBoost perform the best among all classifiers.
- Is Part Of:
- Journal of forecasting. Volume 41:Number 6(2022)
- Journal:
- Journal of forecasting
- Issue:
- Volume 41:Number 6(2022)
- Issue Display:
- Volume 41, Issue 6 (2022)
- Year:
- 2022
- Volume:
- 41
- Issue:
- 6
- Issue Sort Value:
- 2022-0041-0006-0000
- Page Start:
- 1248
- Page End:
- 1313
- Publication Date:
- 2022-05-11
- Subjects:
- credit risk -- feature selection -- machine learning -- personal loan default prediction
Forecasting -- Periodicals
Forecasting -- Mathematical models -- Periodicals
003.2 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/for.2856 ↗
- Languages:
- English
- ISSNs:
- 0277-6693
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4984.577000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23006.xml