International Journal of Information Engineering and Electronic Business(IJIEEB)

ISSN: 2074-9023 (Print), ISSN: 2074-9031 (Online)

Published By: MECS Press

IJIEEB Vol.12, No.4, Aug. 2020

Designing a Real-Time Data-Driven Customer Churn Risk Indicator for Subscription Commerce

Full Text (PDF, 487KB), PP.1-14

Views:119   Downloads:6


Alexandros Deligiannis, Charalampos Argyriou

Index Terms

Churn prediction, Customer relationship management, Prototype algorithm, Purchase transaction data, Conversion rate.


One of the main goals of customer relationship management is to reduce or eliminate “customer churn”, i.e. loss of existing customers. This paper introduces a prototype algorithm to estimate a continuously updated indicator of the probability of an existing customer to cease purchasing from a subscription commerce business. The investigation is focused on the case of repeat consumers of subscription commerce products which require regular replacement or replenishment. The motivation is to help marketers to make targeted proactive retention actions by categorizing regular customers into groups of similar estimated churn risk. The proposed algorithm re-computes the probability of churn for each customer at regular intervals using past purchase transaction data and incorporating subscription-based business logic. We describe the detailed process from data collection and feature engineering to the algorithm’s design. We also present evaluation results of the algorithm’s performance based on a pilot test that took place on a consumables e-commerce business. The results suggest a significant capability of the proposed algorithm in capturing the purchasing intentions of repeat customers, regardless of the risk group they belong to.

Cite This Paper

Alexandros Deligiannis, Charalampos Argyriou, " Designing a Real-Time Data-Driven Customer Churn Risk Indicator for Subscription Commerce", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.12, No.4, pp. 1-14, 2020. DOI: 10.5815/ijieeb.2020.04.01


[1]Huang, B., Kechadi, M. and Buckley, B. (2012). Customer churn prediction in telecommunications. Expert Systems with Applications, 39(1), pp.1414-1425.

[2]Neslin, S. A., Gupta, S., Kamakura, W., Lu, J., & Mason, C. H. (2006). Defection detection: Measuring and understanding the predictive accuracy of customer churn models. Journal of marketing research, 43(2), 204-211.

[3]Bischof, S. F., Boettger, T. M., & Rudolph, T. (2019). Curated subscription commerce: A theoretical conceptualization. Journal of Retailing and Consumer Services, 101822.

[4]Van den Poel, D. and Larivière, B. (2004). Customer attrition analysis for financial servicesusing proportional hazard models. European Journal of Operational Research, 157(1),pp.196-217.

[5]Coussement, K. and Poel, D. (2009). Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers.Expert Systems with Applications, 36(3), pp. 6127-6134.

[6]Dalvi, P. K., Khandge, S. K., Deomore, A., Bankar, A., & Kanade, V. A. (2016, March). Analysis of customer churn prediction in telecom industry using decision trees and logistic regression. In 2016 Symposium on Colossal Data Analysis and Networking (CDAN) (pp. 1-4). IEEE.

[7]Hadden, J., Tiwari, A., Roy, R., & Ruta, D. (2007). Computer assisted customer churn management: State-of-the-art and future trends. Computers & Operations Research, 34(10), 2902-2917.

[8]Baumann, A., Lessmann, S., Coussement, K., & De Bock, K. W. (2015). Maximize What Matters: Predicting Customer Churn With Decision-Centric Ensemble Selection. In ECIS.

[9]Siber, R. (1997). Combating the churn phenomenon. Telecommunications, 31(10), 77–81.

[10]Deligiannis, A., Argyriou, C. & Kourtesis, D. (2019). Predictive personalization of conversational customer communications with data protection by design. IEEE/WIC/ACM International Conference on Web Intelligence on - WI ’19 Companion.

[11]Deligiannis, A., Argyriou, C., & Kourtesis, D. (2020). Building a Cloud-based Regression Model to Predict Click-through Rate in Business Messaging Campaigns. International Journal of Modeling and Optimization, 10(1), 26-31. doi:10.7763/IJMO.2020.V10.742

[12]Gordini, N. and Veglio, V. (2017). Customers churn prediction and marketing retention strategies. An application of support vector machines based on the AUC parameter-selection technique in B2B e-commerce industry. Industrial Marketing Management, 62, pp. 100-107.

[13]Vafeiadis, T., Diamantaras, K. I., Sarigiannidis, G., & Chatzisavvas, K. C. (2015). A comparison of machine learning techniques for customer churn prediction. Simulation Modelling Practice and Theory, 55, 1-9.

[14]Ahmad, A. K., Jafar, A., & Aljoumaa, K. (2019). Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data, 6(1), 28. 

[15]Wu, X., & Meng, S. (2016). E-commerce customer churn prediction based on improved SMOTE and AdaBoost. In 2016 13th International Conference on Service Systems and Service Management (ICSSSM) (pp. 1-5). IEEE.

[16]Berman, B. (2016). Referral marketing: Harnessing the power of your customers. Business Horizons, 59(1), pp. 19-28.

[17]Zhuang, Y. Y. (2018). Research on E-commerce Customer Churn Prediction Based on Improved Value Model and XG-Boost Algorithm. Management Science and Engineering, 12 (3), 51-56.

[18]Ascarza, E. (2018). Retention futility: Targeting high-risk customers might be ineffective. Journal of Marketing Research, 55(1), 80-98. 

[19]Cao, L. (2010). In-depth behavior understanding and use: The behavior informatics approach. Information Sciences, 180(17), pp.3067-3085.

[20]Wang, C. (2010). Apply robust segmentation to the service industry using kernel induced fuzzy clustering techniques. Expert Systems with Applications, 37(12), pp.8395-8400.

[21]Kaya, E., Dong, X., Suhara, Y., Balcisoy, S., Bozkaya, B. and Pentland, A. (2018). Behavioral attributes and financial churn prediction. EPJ Data Science, 7(1).

[22]Keramati, A. and Ardabili, S. (2011). Churn analysis for an Iranian mobile operator. Telecommunications Policy, 35(4), pp.344-356.

[23]Bose, I., & Chen, X. (2009). Hybrid models using unsupervised clustering for prediction of customer churn. Journal of Organizational Computing and Electronic Commerce, 19(2), 133-151.

[24]Euler, T. (2005). Churn prediction in telecommunications using mining mart. Proceedings of the Workshop on Data Mining and Business (DMBiz) at the 9th European Conference on Principles and Practice in Knowledge Discovery in Databases (PKDD).

[25]Coussement, K. and Van den Poel, D. (2008). Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques. Expert Systems with Applications, 34(1), pp.313-327.

[26]J. Hadden, A. Tiwari, R. Roy, and D. Ruta. (2006). Churn prediction using complaints data. Proceedings of world academy of science, engineering, and technology, 13:158–163.

[27]Romov, P. and Sokolov, E. (2015). RecSys Challenge 2015: Ensemble Learning with Categorical Features. In Proceedings of the 2015 International ACM Recommender Systems Challenge (RecSys ’15 Challenge). ACM, New York, NY, USA, Article 1, 4 pages.

[28]Sheil, H., Rana, O., & Reilly, R.G. (2018). Predicting Purchasing Intent: Automatic Feature Learning using Recurrent Neural Networks. ArXiv, abs/1807.08207.

[29]Toth, A., Tan L., Fabbrizio G., and Datta, A.. (2017). Predicting Shopping Behavior with Mixture of RNNs. In ACM SIGIR Forum. ACM.

[30]Wu C. Y., Ahmed, A., Beutel, A., Smola, A. J., and HowJing. (2017). Recurrent Recommender Networks. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM ’17). ACM, New York, NY, USA, 495–503.

[31]D’Angelo, G., Rampone, S., & Palmieri, F. (2017). Developing a trust model for pervasive computing based on Apriori association rules learning and Bayesian classification. Soft Computing, 21(21), 6297-6315.

[32]Chiang D, Wang Y, Lee S, Lin C (2003) Goal-oriented sequential pattern for network banking and churn analysis. Expert Systems with Applications 25(3): 293–302.

[33]Verbeke, W., Martens, D., Mues, C., & Baesens, B. (2011). Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert systems with applications, 38(3), 2354-2364.

[34]Huang Y, Huang B, Kechadi MT. (2011). A rule-based method for customer churn prediction in telecommunication services. Advances in knowledge discovery and data mining. Berlin Heidelberg: Springer. pp. 411–22.

[35]Zhang, Y., Wang, Y., He, C. and Yang, T. (2014). Modeling and Application Research on Customer Churn Warning System Based in Big Data Era. International Journal of Multimedia and Ubiquitous Engineering, 9(9), pp. 281-298.

[36]Milošević, M., Živić, N., & Andjelković, I. (2017). Early churn prediction with personalized targeting in mobile social games. Expert Systems with Applications, 83, 326-332. 

[37]Deligiannis, A., Argyriou, C. and Kourtesis, D. (2020). Predicting the Optimal Date and Time to Send Personalized Marketing Messages to Repeat Buyers. International Journal of Advanced Computer Science and Applications, 11(4).

[38]Naik, N. (2017). Choice of effective messaging protocols for IoT systems: MQTT, CoAP, AMQP and HTTP. 2017 IEEE International Systems Engineering Symposium (ISSE).

[39]Dobbelaere, Philippe & Sheykh Esmaili, Kyumars. (2017). Kafka versus RabbitMQ.

[40]Lachaud, E. (2019). Adhering to GDPR Codes of Conduct: A Possible Option for SMEs to GDPR Certification. SSRN Electronic Journal.

[41]Botchkarev, A. (2019). A New Typology Design of Performance Metrics to Measure Errors in Machine Learning Regression Algorithms. Interdisciplinary Journal of Information, Knowledge, and Management, 14, pp.045-076.

[42]Willmott, C. and Matsuura, K. (2005). Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Climate Research, 30, pp.79-82.

[43]Alexander, D., Tropsha, A. and Winkler, D. (2015). Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models. Journal of Chemical Information and Modeling, 55(7), pp.1316-1322.

[44]Singh, P., Anand, S. and B., S. (2017). Big Data Analysis with Apache Spark. International Journal of Computer Applications, 175(5), pp.6-8.

[45]Olasehinde, O., Johnson, O. and Fakoya, J. (2018). Computational Efficiency Analysis of Customer Churn Prediction Using Spark and Caret Random Forest Classifier. Information and Knowledge Management. Department of Computer Science, The Federal Polytechnic, Ile-Oluji, Ondo State, Nigeria, 8(2), pp.8-16.

[46]Bozhinov, I. (2019). AI and big data on IBM Power Systems servers. U.S.A.: IBM Corporation.

[47]Palmer, T. (2019). Predict and Optimize Business Outcomes with IBM Decision Optimization for Watson Studio and IBM Cloud Pak for Data. The Enterprise Strategy Group, Inc., pp.3-14.

[48]Dhoolia, P., Chugh, P., Costa, P., Gantayat, N., Gupta, M., Kambhatla, N., Kumar, R., Mani,S., Mitra, P., Rogerson, C. and Saxena, M. (2017). A cognitive system for business andtechnical support: A case study. IBM Journal of Research and Development, 61(1), pp.7:74-7:85.