Graph based clustering for anomaly detection in IP networks

No Thumbnail Available

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Master's thesis
Checking the digitized thesis and permission for publishing
Instructions for the author

Date

2011

Major/Subject

Informaatiotekniikka

Mcode

T-61

Degree programme

Language

en

Pages

[11] + 67

Series

Abstract

In IP networks, an anomaly detection system identifies attacks, device failures or other unknown processes that deviate from the normal behaviour of the network known as anomalies. The thesis studied anomaly detection in traffic datasets from IP networks. The datasets contained high number of normal events and few anomalies. This resembles a normally operating network. We construct graphs from traffic data and study their properties. We formulated anomaly detection as a graph based clustering problem. A novel graph bi-partitioning algorithm called NodeClustering was designed to separate normal samples from anomalous ones. Performance of NodeClustering was investigated with extensive network traffic data. The performance was compared with state of the art graph based spectral clustering algorithms. NodeClustering identified all the known intrusions in the data and outperformed the compared graph based methods with an average improvement of 50% on the true positive rate with lowest false positive rate on the studied datasets. In addition, its applicability to one non-traffic dataset was shown. NodeClustering can be used in IP networks to detect anomalies. In the future, threshold used for graph partitioning can be studied further and computationally efficient < methods to construct larger graphs might be studied.

Description

Supervisor

Oja, Erkki

Thesis advisor

Hätönen, Kimmo

Keywords

anomaly detection, graphs, node degree, node clustering, weight matrix, partition, clustering

Other note

Citation