概括性
计算机科学
吞吐量
计算
散列函数
素描
流式处理
实时计算
数据挖掘
并行计算
算法
程序设计语言
心理学
电信
心理治疗师
无线
作者
Zhong Yi,Xiang Li,Boyu Tian,Zhiyu Mei,Wenfei Wu
标识
DOI:10.1145/3447548.3467353
摘要
Data stream processing is a crucial computation task in data mining applications. The rigid and fixed data structures in existing solutions limit their accuracy, throughput, and generality in measurement tasks. We propose Dynamic Hierarchical Sketch (DHS), a sketch-based hybrid solution targeting these properties. During the online stream processing, DHS hashes items to buckets and organizes cells in each bucket dynamically; the size of all cells in a bucket is adjusted adaptively to the actual size and distribution of flows. Thus, memory is efficiently used to precisely record elephant flows and cover more mice flows. Implementation and evaluation show that DHS achieves high accuracy, high throughput, and high generality on five measurement tasks: flow size estimation, flow size distribution estimation, heavy hitter detection, heavy changer detection, and entropy estimation.
科研通智能强力驱动
Strongly Powered by AbleSci AI