Unveiling the Power of Hybrid Balancing Techniques and Ensemble Stacked and Blended Classifiers for Enhanced Churn Prediction
Conference paper
Gaikwad, K., Berardinelli, N. and Qazi, N. 2024. Unveiling the Power of Hybrid Balancing Techniques and Ensemble Stacked and Blended Classifiers for Enhanced Churn Prediction. 16th Asian Conference on Intelligent Information and Database Systems. UAE 15 Apr 2024 - 18 Jun 2025 Springer. https://doi.org/10.1007/978-981-97-5937-8_20
Authors | Gaikwad, K., Berardinelli, N. and Qazi, N. |
---|---|
Type | Conference paper |
Abstract | For businesses, customer retention is crucial as it is more cost-effective than acquiring new customers. Identifying potential customer churn early allows for the development of effective retention strategies. With advancements in technology and data storage, machine learning has become a popular approach for predicting customer churn. To counteract data imbalance, researchers have utilized minority oversampling methods, particularly the Synthetic Minority Over-sampling Technique (SMOTE). Innovations in this area include hybrid techniques like SMOTE Tomek-Links and SMOTE ENN, which have shown effectiveness in data resampling. Traditional classifiers like Logistic Regression, Naïve Bayes, Support Vector Machine, and K-Nearest Neighbors have been surpassed in performance by ensemble classifiers such as XgBoost, LightGBM, and CatBoost. Yet, there is limited research on the combination of SMOTE hybrid techniques with these advanced ensemble classifiers for churn prediction. This study aims to contribute to the field by integrating hybrid balancing techniques with ensemble classifiers and introducing new stacked and blended models. The findings reveal that a stacked model incorporating SMOTE ENN achieved impressive results: 96.46% accuracy, 97% F1 score, and 97.40% PR-AUC. This was closely followed by the CatBoost-SMOTE ENN model, which scored 95.32% in accuracy, 96% F1 score, and 96.50% PR-AUC. In contrast, ADASYN and standard SMOTE techniques did not significantly affect model performance. |
Year | 2024 |
Conference | 16th Asian Conference on Intelligent Information and Database Systems |
Publisher | Springer |
Accepted author manuscript | License File Access Level Anyone |
Publication dates | |
Online | 13 Aug 2024 |
Publication process dates | |
Deposited | 13 Jun 2025 |
Journal | Communications in Computer and Information Science |
Journal citation | 2144, p. 238–249 |
ISSN | 0302-9743 |
1611-3349 | |
Book title | Intelligent Information and Database Systems: 16th Asian Conference, ACIIDS 2024, Ras Al Khaimah, UAE, April 15–18, 2024, Proceedings, Part I |
Book editor | Nguyen, N. T. |
Chbeir, R. | |
Manolopoulos, Y. | |
Fujita, H. | |
Hong, T-P. | |
Nguyen, L. M. | |
Wojtkiewicz, K. | |
ISBN | 978-981-97-4981-2 |
978-981-97-4982-9 | |
Digital Object Identifier (DOI) | https://doi.org/10.1007/978-981-97-5937-8_20 |
Web address (URL) of conference proceedings | https://link.springer.com/book/10.1007/978-981-97-4982-9 |
Copyright holder | © 2024 The Authors |
https://repository.uel.ac.uk/item/8zqy4
Restricted files
Accepted author manuscript
5
total views1
total downloads5
views this month1
downloads this month