Churn Prediction System for Telecom using Filter–Wrapper and Ensemble Classification. (1st February 2016)
- Record Type:
- Journal Article
- Title:
- Churn Prediction System for Telecom using Filter–Wrapper and Ensemble Classification. (1st February 2016)
- Main Title:
- Churn Prediction System for Telecom using Filter–Wrapper and Ensemble Classification
- Authors:
- Idris, Adnan
Khan, Asifullah - Abstract:
- Abstract: Churn prediction in telecom is a challenging data mining task for retaining customers, especially, when we have imbalanced class distribution, high dimensionality and large number of samples in training set. To cope with this challenging task of churn prediction, we propose a new intelligent churn prediction system for telecom, named FW-ECP. The novelty of the FW-ECP lies in its ability to combine both filter- and wrapper-based feature selection as well as exploit the learning capability of an ensemble classifier built using diverse base classifiers. In the filter phase, Particle Swarm Optimization-based undersampling and mRMR feature selection are employed to reduce the effect of imbalanced class distribution and large dimensionality. In Wrapper phase, we employ Genetic Algorithm that further discards irrelevant and redundant features. Random Forest, Rotation Forest, RotBoost and SVMs are then employed to exploit the new feature space. Finally, the ensemble classifier is constructed using both majority voting and stacking. We have tested and compared the performance of proposed FW-ECP system on two publicly available standard telecom datasets: Orange and Cell2Cell. FW-ECP takes into account both the imbalanced nature and large dimensionality of the training sets and yields better prediction performances compared with existing state-of-the-art approaches. The feature spaces for the Orange and Cell2Cell datasets are reduced to 24D and 18D, from 260D and 76D,Abstract: Churn prediction in telecom is a challenging data mining task for retaining customers, especially, when we have imbalanced class distribution, high dimensionality and large number of samples in training set. To cope with this challenging task of churn prediction, we propose a new intelligent churn prediction system for telecom, named FW-ECP. The novelty of the FW-ECP lies in its ability to combine both filter- and wrapper-based feature selection as well as exploit the learning capability of an ensemble classifier built using diverse base classifiers. In the filter phase, Particle Swarm Optimization-based undersampling and mRMR feature selection are employed to reduce the effect of imbalanced class distribution and large dimensionality. In Wrapper phase, we employ Genetic Algorithm that further discards irrelevant and redundant features. Random Forest, Rotation Forest, RotBoost and SVMs are then employed to exploit the new feature space. Finally, the ensemble classifier is constructed using both majority voting and stacking. We have tested and compared the performance of proposed FW-ECP system on two publicly available standard telecom datasets: Orange and Cell2Cell. FW-ECP takes into account both the imbalanced nature and large dimensionality of the training sets and yields better prediction performances compared with existing state-of-the-art approaches. The feature spaces for the Orange and Cell2Cell datasets are reduced to 24D and 18D, from 260D and 76D, respectively. The AUCs obtained by FW-ECP are 0.85 and 0.82 for Orange and Cell2Cell datasets, respectively. … (more)
- Is Part Of:
- Computer journal. Volume 60:Number 3(2017)
- Journal:
- Computer journal
- Issue:
- Volume 60:Number 3(2017)
- Issue Display:
- Volume 60, Issue 3 (2017)
- Year:
- 2017
- Volume:
- 60
- Issue:
- 3
- Issue Sort Value:
- 2017-0060-0003-0000
- Page Start:
- 410
- Page End:
- 430
- Publication Date:
- 2016-02-01
- Subjects:
- telecom churn prediction -- Filter–Wrapper -- particle swarm optimization -- minimum redundancy and maximum relevance -- Genetic Algorithm -- ensemble classification
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxv123 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21744.xml