Device Identification from Network Traffic Measurements - A HTTP User Agent Based Method
No Thumbnail Available
URL
Journal Title
Journal ISSN
Volume Title
Sähkötekniikan korkeakoulu |
Master's thesis
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Authors
Date
2012
Major/Subject
Tietoverkkotekniikka
Mcode
S-38
Degree programme
Language
en
Pages
[6] + 72 s.
Series
Abstract
The proliferation of mobile Internet in recent years has created an increasing need to understand the usage of the mobile services. With widespread adoption of the Internet capable mobile handheld devices, knowledge about mobile Internet usage is beneficial from different aspects of the stakeholders. A key challenge is that the factual information available on User Agent (UA) based device identification from IP traffic measurements is limited. The objective of our research is two folded; to develop a tool to identify mobile devices based on the HTTP UA obtained from the network traffic measurements and to profile mobile Internet usage in Finland. We observe that the tool can be developed by using a device description repository (DDR) and its API to interact with the repository and extract device related information. Moreover, the results from the DDR implementation can be improved by enhancing its original output. With the identification results, we provide descriptive statistics to aid in profiling the usage of the mobile Internet in Finnish mobile networks. Wireless Universal Resource File (WURFL) DDR based tool produced accurate identification of the devices from the HTTP UA strings. However, identification of the devices from the UA strings generated by the applications other than the web browsers required additional programming. The resulting enhanced WURFL tool was able to improve the device identification results roughly by 15% points with our dataset. Based on the assessment of the enhanced WURFL tool, we observe that roughly 94% of the total UA strings subjected to the analysis were identified correctly. The share of incorrectly identified UA strings was about 0.5%. The data analysis results indicate that the majority of mobile handset traffic is generated by handsets with advanced capabilities such as 3G and the touchscreen, manufactured by numerous brands of mobile devices with different operating systems. The results from the identification of these devices and device features could be utilized by the operators to support the pricing and business development.Description
Supervisor
Hämmäinen, HeikkiThesis advisor
Riikonen, AnttiKeywords
network traffic measurements, mobile internet, device identification, HTTP user agent, DDR, WURFL, Java API