Predictive modelling of user engagement for subscription retention

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorRautio, Sini
dc.contributor.authorJayanegara, Angeline
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorVehtari, Aki
dc.date.accessioned2025-08-19T17:28:53Z
dc.date.available2025-08-19T17:28:53Z
dc.date.issued2025-07-31
dc.description.abstractThe rise of digital news and media subscriptions has made subscription based business models increasingly important for news and media organisations such as Sanoma. In this context, understanding and predicting subscription churn of Helsingin Sanomat (main product of Sanoma Media Finland and one of the largest news provider in Finland), as well as identifying the key factors behind it, is essential for ensuring the long term sustainability of the digital media. This thesis investigates subscription retention and churn through predictive modelling, comparing a classical logistic regression model with a more advanced ensemble method, XGBoost. The main objective was not only to improve the accuracy of subscription churn prediction but also to interpret the driving factors behind the churn, thus providing insights for retention strategies. The analysis presents that XGBoost outperformed logistic regression by effectively capturing nonlinear relationships and interactions within the data. For model interpretation, feature importance analysis, gain and SHAP values were applied to the XGBoost model. The SHAP analysis indicated that customers with longer subscription histories and higher engagement activities were less likely to churn. However, these results should be interpreted as insights rather than causal conclusions.en
dc.format.extent41
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/138221
dc.identifier.urnURN:NBN:fi:aalto-202508196451
dc.language.isoenen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programme.majorMachine Learning, Data Science and Artificial Intelligenceen
dc.subject.keywordmachine learningen
dc.subject.keyworddata scienceen
dc.subject.keywordlogistic regressionen
dc.subject.keywordXGBoosten
dc.subject.keywordsubscription retentionen
dc.subject.keywordfeature importance analysisen
dc.titlePredictive modelling of user engagement for subscription retentionen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Jayanegara_Angeline_2025.pdf
Size:
1.38 MB
Format:
Adobe Portable Document Format