Predictive modelling of user engagement for subscription retention

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Master's thesis

Department

Mcode

Language

en

Pages

41

Series

Abstract

The rise of digital news and media subscriptions has made subscription based business models increasingly important for news and media organisations such as Sanoma. In this context, understanding and predicting subscription churn of Helsingin Sanomat (main product of Sanoma Media Finland and one of the largest news provider in Finland), as well as identifying the key factors behind it, is essential for ensuring the long term sustainability of the digital media. This thesis investigates subscription retention and churn through predictive modelling, comparing a classical logistic regression model with a more advanced ensemble method, XGBoost. The main objective was not only to improve the accuracy of subscription churn prediction but also to interpret the driving factors behind the churn, thus providing insights for retention strategies. The analysis presents that XGBoost outperformed logistic regression by effectively capturing nonlinear relationships and interactions within the data. For model interpretation, feature importance analysis, gain and SHAP values were applied to the XGBoost model. The SHAP analysis indicated that customers with longer subscription histories and higher engagement activities were less likely to churn. However, these results should be interpreted as insights rather than causal conclusions.

Description

Supervisor

Vehtari, Aki

Thesis advisor

Rautio, Sini

Other note

Citation