Clustering and predicting the data usage patterns of geographically diverse mobile users

No Thumbnail Available
Access rights
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
Degree programme
Computer Networks
Mobile users demand more and more data traffic, yet network resources are limited. This creates a challenge for network resource management. One way of addressing this challenge is by understanding the data usage patterns of mobile users so that resources can be optimally allocated based on user traffic demand and data usage behavior. However, understanding and characterizing the data usage patterns of mobile users is a complex task. In this work, we investigate and characterize users’ data usage patterns and behavior in mobile networks. We leverage a dataset (∼113 million records) collected through a crowd-based mobile network measurement platform – Netradar – across five countries. Data usage behavior of users over a cellular network is primarily driven by user mobility, the type of subscription plan marketed by Mobile Network Operators (MNOs), network congestion, and network coverage. We apply an unsupervised machine learning approach to cluster mobile user types by considering different factors such as data consumption, network access type, the number of sessions created per user, throughput, and mobility. By defining data usage pattern of mobile users, we develop a user clustering model and identify three different mobile user groups (clusters). Our clustering model shows that the data usage patterns are unevenly distributed across the five countries studied, characterized by a small number of heavy users consuming the highest volume of data. We show how the types of applications installed by users correlate with data consumption patterns in some countries. Heavy users tend to install more traffic-demanding apps than users from the other two groups – regular and light users. Finally, we trained a classification model using the labeled dataset produced by our aforementioned user clustering method. The model helps classifying mobile users according to their usage patterns (i.e., heavy, regular, and light) with an accuracy of ∼80% in the test dataset.
Mobile networks, Data usage patterns, User behavior modeling, Clustering data usage
Other note
Walelgne , E , Asrese , A , Manner , J , Bajpai , V & Ott , J 2021 , ' Clustering and predicting the data usage patterns of geographically diverse mobile users ' , Computer Networks , vol. 187 , 107737 .