SuperSketch: A Multi-Dimensional Reversible Data Structure for Super Host Identification

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Major/Subject

Mcode

Degree programme

Language

en

Pages

14

Series

IEEE Transactions on Dependable and Secure Computing, Volume 19, issue 4, pp. 2741-2754

Abstract

Facing big network traffic data, effective data compression becomes crucially important and urgently needed for estimating host cardinalities and identifying super hosts. However, the current literature confronts several challenges: incapability of simultaneously measuring various types of host cardinalities and inability to efficiently reconstruct super host addresses. To address these challenges, in this paper, we propose a novel sketch data structure, named SuperSketch, to simultaneously measure multiple types of host cardinalities with the purpose of efficiently identifying super hosts. SuperSketch has two significant characteristics: multi-dimensionality and reversibility. The multi-dimensionality makes SuperSketch capable of simultaneously measuring Source Cardinality, Destination Cardinality and Destination Port Cardinality. The reversibility allows SuperSketch to accurately and quickly reconstruct the original addresses of super hosts once they are identified. We conduct both theoretical analysis and performance evaluation based on real-world network traffic. Experimental results show that SuperSketch achieves outstanding performance for multi-cardinality measurement, super host identification and host address reconstruction.

Description

Talleta OA-artikkeli, kun julkaistu.

Other note

Citation

Jing, X, Han, H, Yan, Z & Pedrycz, W 2022, 'SuperSketch: A Multi-Dimensional Reversible Data Structure for Super Host Identification', IEEE Transactions on Dependable and Secure Computing, vol. 19, no. 4, pp. 2741-2754. https://doi.org/10.1109/TDSC.2021.3072295