Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods. (June 2019)
- Record Type:
- Journal Article
- Title:
- Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods. (June 2019)
- Main Title:
- Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods
- Authors:
- Amin, Adnan
Shah, Babar
Khattak, Asad Masood
Lopes Moreira, Fernando Joaquim
Ali, Gohar
Rocha, Alvaro
Anwar, Sajid - Abstract:
- Highlights: Devised a model for Cross-Company Churn Prediction (CCCP) in telecommunication sector. Comprehensively explored the impact of the data transformation methods (i.e., Log, Rank, Box-Cox, and Z-Score) on CCCP model performance. Investigated the performance of multiple state-of-the-art classifiers (i.e., Naïve Bayes, K-nearest neighbour, Gradient Boosted Tree, Single Rule Induction, and Deep Learner Neural Net). Empirically addressed the following research questions: RQ1: What is the effect of DT methods (i.e., log, Rank, Box-Cox and Z-Score) on data normality in CCCP? RQ2: What impact does the DT method has in the performance of different classifiers? RQ3: Do the application of different DT methods exhibits significant performance difference? Abstract: Cross-Company Churn Prediction (CCCP) is a domain of research where one company (target) is lacking enough data and can use data from another company (source) to predict customer churn successfully. To support CCCP, the cross-company data is usually transformed to a set of similar normal distribution of target company data prior to building a CCCP model. However, it is still unclear which data transformation method is most effective in CCCP. Also, the impact of data transformation methods on CCCP model performance using different classifiers have not been comprehensively explored in the telecommunication sector. In this study, we devised a model for CCCP using data transformation methods (i.e., log, z-score, rank andHighlights: Devised a model for Cross-Company Churn Prediction (CCCP) in telecommunication sector. Comprehensively explored the impact of the data transformation methods (i.e., Log, Rank, Box-Cox, and Z-Score) on CCCP model performance. Investigated the performance of multiple state-of-the-art classifiers (i.e., Naïve Bayes, K-nearest neighbour, Gradient Boosted Tree, Single Rule Induction, and Deep Learner Neural Net). Empirically addressed the following research questions: RQ1: What is the effect of DT methods (i.e., log, Rank, Box-Cox and Z-Score) on data normality in CCCP? RQ2: What impact does the DT method has in the performance of different classifiers? RQ3: Do the application of different DT methods exhibits significant performance difference? Abstract: Cross-Company Churn Prediction (CCCP) is a domain of research where one company (target) is lacking enough data and can use data from another company (source) to predict customer churn successfully. To support CCCP, the cross-company data is usually transformed to a set of similar normal distribution of target company data prior to building a CCCP model. However, it is still unclear which data transformation method is most effective in CCCP. Also, the impact of data transformation methods on CCCP model performance using different classifiers have not been comprehensively explored in the telecommunication sector. In this study, we devised a model for CCCP using data transformation methods (i.e., log, z-score, rank and box-cox) and presented not only an extensive comparison to validate the impact of these transformation methods in CCCP, but also evaluated the performance of underlying baseline classifiers (i.e., Naive Bayes (NB), K-Nearest Neighbour (KNN), Gradient Boosted Tree (GBT), Single Rule Induction (SRI) and Deep learner Neural net (DP)) for customer churn prediction in telecommunication sector using the above mentioned data transformation methods. We performed experiments on publicly available datasets related to the telecommunication sector. The results demonstrated that most of the data transformation methods (e.g., log, rank, and box-cox) improve the performance of CCCP significantly. However, the Z-Score data transformation method could not achieve better results as compared to the rest of the data transformation methods in this study. Moreover, it is also investigated that the CCCP model based on NB outperform on transformed data and DP, KNN and GBT performed on the average, while SRI classifier did not show significant results in term of the commonly used evaluation measures (i.e., probability of detection, probability of false alarm, area under the curve and g-mean). … (more)
- Is Part Of:
- International journal of information management. Volume 46(2019)
- Journal:
- International journal of information management
- Issue:
- Volume 46(2019)
- Issue Display:
- Volume 46, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 46
- Issue:
- 2019
- Issue Sort Value:
- 2019-0046-2019-0000
- Page Start:
- 304
- Page End:
- 319
- Publication Date:
- 2019-06
- Subjects:
- Churn prediction -- Cross-company -- Data transformation -- Box-cox -- Rank -- Log -- Z-Score
Social sciences -- Information services -- Periodicals
Social sciences -- Research -- Periodicals
Information science -- Periodicals
Management information systems -- Periodicals
Knowledge management -- Periodicals
Sciences sociales -- Documentation, Services de -- Périodiques
Sciences sociales -- Recherche -- Périodiques
Sciences de l'information -- Périodiques
Systèmes d'information de gestion -- Périodiques
Information science
Management information systems
Social sciences -- Information services
Social sciences -- Research
Periodicals
Electronic journals
025.52068 - Journal URLs:
- http://www.sciencedirect.com/science/journal/02684012 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijinfomgt.2018.08.015 ↗
- Languages:
- English
- ISSNs:
- 0268-4012
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.304900
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9731.xml