Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending. (20th July 2022)
- Record Type:
- Journal Article
- Title:
- Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending. (20th July 2022)
- Main Title:
- Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending
- Authors:
- Xia, Yufei
Guo, Xinyi
Li, Yinguo
He, Lingyun
Chen, Xueyuan - Abstract:
- Abstract: Online consumer lending has recently been growing rapidly, but it faces high credit risk. For this problem, developing powerful credit scoring models has become an effective solution and can be achieved from three aspects: modeling approach, data source, and evaluation measure. This paper proposes a novel model that departs from those in previous studies in threefold. First, a heterogeneous deep forest model that combines deep learning architecture and tree‐based ensemble classifiers is proposed as the modeling approach. Second, a Bayesian‐based macroeconomic variable optimization method is developed to determine the macroeconomic variables and the corresponding lag term, and the selected macroeconomic variables are used as supplementary data source for modeling. Lastly, a series of capital charge error measures is proposed to evaluate credit scoring models from a regulatory perspective. The proposal is evaluated on multiple large datasets under performance measures on predictive accuracy, profitability, and capital charge errors. Frequentist and Bayesian nonparametric significance tests are used to examine the statistical significance of heterogeneous deep forest and benchmarks. Three main conclusions can be reached from the comparison. First, heterogeneous deep forest significantly outperforms the industry benchmarks over all the evaluation measures. Second, the predictive performance is enhanced after incorporating the selected macroeconomic variables and theAbstract: Online consumer lending has recently been growing rapidly, but it faces high credit risk. For this problem, developing powerful credit scoring models has become an effective solution and can be achieved from three aspects: modeling approach, data source, and evaluation measure. This paper proposes a novel model that departs from those in previous studies in threefold. First, a heterogeneous deep forest model that combines deep learning architecture and tree‐based ensemble classifiers is proposed as the modeling approach. Second, a Bayesian‐based macroeconomic variable optimization method is developed to determine the macroeconomic variables and the corresponding lag term, and the selected macroeconomic variables are used as supplementary data source for modeling. Lastly, a series of capital charge error measures is proposed to evaluate credit scoring models from a regulatory perspective. The proposal is evaluated on multiple large datasets under performance measures on predictive accuracy, profitability, and capital charge errors. Frequentist and Bayesian nonparametric significance tests are used to examine the statistical significance of heterogeneous deep forest and benchmarks. Three main conclusions can be reached from the comparison. First, heterogeneous deep forest significantly outperforms the industry benchmarks over all the evaluation measures. Second, the predictive performance is enhanced after incorporating the selected macroeconomic variables and the corresponding lag, and the result remains robust under cross‐validation and forward‐chaining validation. Third, the capital charge errors reflect the model performance from a regulatory perspective and thus lead to different rankings from those when evaluating predictive accuracy and profitability. … (more)
- Is Part Of:
- Journal of forecasting. Volume 41:Number 8(2022)
- Journal:
- Journal of forecasting
- Issue:
- Volume 41:Number 8(2022)
- Issue Display:
- Volume 41, Issue 8 (2022)
- Year:
- 2022
- Volume:
- 41
- Issue:
- 8
- Issue Sort Value:
- 2022-0041-0008-0000
- Page Start:
- 1669
- Page End:
- 1690
- Publication Date:
- 2022-07-20
- Subjects:
- credit scoring -- deep learning -- financial regulation -- gradient boosting decision tree -- macroeconomic variable -- online consumer lending
Forecasting -- Periodicals
Forecasting -- Mathematical models -- Periodicals
003.2 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/for.2891 ↗
- Languages:
- English
- ISSNs:
- 0277-6693
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4984.577000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 24237.xml