Graph based clustering for anomaly detection in IP networks
No Thumbnail Available
Journal Title
Journal ISSN
Volume Title
School of Science |
Master's thesis
Checking the digitized thesis and permission for publishing
Instructions for the author
Instructions for the author
Author
Date
2011
Department
Major/Subject
Informaatiotekniikka
Mcode
T-61
Degree programme
Language
en
Pages
[11] + 67
Series
Abstract
In IP networks, an anomaly detection system identifies attacks, device failures or other unknown processes that deviate from the normal behaviour of the network known as anomalies. The thesis studied anomaly detection in traffic datasets from IP networks. The datasets contained high number of normal events and few anomalies. This resembles a normally operating network. We construct graphs from traffic data and study their properties. We formulated anomaly detection as a graph based clustering problem. A novel graph bi-partitioning algorithm called NodeClustering was designed to separate normal samples from anomalous ones. Performance of NodeClustering was investigated with extensive network traffic data. The performance was compared with state of the art graph based spectral clustering algorithms. NodeClustering identified all the known intrusions in the data and outperformed the compared graph based methods with an average improvement of 50% on the true positive rate with lowest false positive rate on the studied datasets. In addition, its applicability to one non-traffic dataset was shown. NodeClustering can be used in IP networks to detect anomalies. In the future, threshold used for graph partitioning can be studied further and computationally efficient < methods to construct larger graphs might be studied.Description
Supervisor
Oja, ErkkiThesis advisor
Hätönen, KimmoKeywords
anomaly detection, graphs, node degree, node clustering, weight matrix, partition, clustering