Learning latent representations of bank customers with the Variational Autoencoder. (February 2021)
- Record Type:
- Journal Article
- Title:
- Learning latent representations of bank customers with the Variational Autoencoder. (February 2021)
- Main Title:
- Learning latent representations of bank customers with the Variational Autoencoder
- Authors:
- Mancisidor, Rogelio A.
Kampffmeyer, Michael
Aas, Kjersti
Jenssen, Robert - Abstract:
- Abstract: Learning data representations that reflect the customers' creditworthiness can improve marketing campaigns, customer relationship management, data and process management or the credit risk assessment in retail banks. In this research, we show that it is possible to steer data representations in the latent space of the Variational Autoencoder (VAE) using a semi-supervised learning framework and a specific grouping of the input data called Weight of Evidence (WoE). Our proposed method learns a latent representation of the data showing a well-defied clustering structure. The clustering structure captures the customers' creditworthiness, which is unknown a priori and cannot be identified in the input space. The main advantages of our proposed method are that it captures the natural clustering of the data, suggests the number of clusters, captures the spatial coherence of customers' creditworthiness, generates data representations of unseen customers and assign them to one of the existing clusters. Our empirical results, based on real data sets reflecting different market and economic conditions, show that none of the well-known data representation models in the benchmark analysis are able to obtain well-defined clustering structures like our proposed method. Further, we show how banks can use our proposed methodology to improve marketing campaigns and credit risk assessment. Highlights: It is possible to steer the latent space of the Variational Autoencoder. LatentAbstract: Learning data representations that reflect the customers' creditworthiness can improve marketing campaigns, customer relationship management, data and process management or the credit risk assessment in retail banks. In this research, we show that it is possible to steer data representations in the latent space of the Variational Autoencoder (VAE) using a semi-supervised learning framework and a specific grouping of the input data called Weight of Evidence (WoE). Our proposed method learns a latent representation of the data showing a well-defied clustering structure. The clustering structure captures the customers' creditworthiness, which is unknown a priori and cannot be identified in the input space. The main advantages of our proposed method are that it captures the natural clustering of the data, suggests the number of clusters, captures the spatial coherence of customers' creditworthiness, generates data representations of unseen customers and assign them to one of the existing clusters. Our empirical results, based on real data sets reflecting different market and economic conditions, show that none of the well-known data representation models in the benchmark analysis are able to obtain well-defined clustering structures like our proposed method. Further, we show how banks can use our proposed methodology to improve marketing campaigns and credit risk assessment. Highlights: It is possible to steer the latent space of the Variational Autoencoder. Latent representations that learn the customers' creditworthiness. Well-defined clustering structures with statistically different default probabilities. Our method enables visualization and suggests the number of clusters. The proposed methodology generalizes to unseen customers. … (more)
- Is Part Of:
- Expert systems with applications. Volume 164(2021)
- Journal:
- Expert systems with applications
- Issue:
- Volume 164(2021)
- Issue Display:
- Volume 164, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 164
- Issue:
- 2021
- Issue Sort Value:
- 2021-0164-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Variational Autoencoder -- Data representations -- Clustering -- Machine learning
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2020.114020 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14894.xml