Inicio  /  Future Internet  /  Vol: 12 Par: 9 (2020)  /  Artículo
ARTÍCULO
TITULO

On Frequency Estimation and Detection of Heavy Hitters in Data Streams

Federica Ventruto    
Marco Pulimeno    
Massimo Cafaro and Italo Epicoco    

Resumen

A stream can be thought of as a very large set of data, sometimes even infinite, which arrives sequentially and must be processed without the possibility of being stored. In fact, the memory available to the algorithm is limited and it is not possible to store the whole stream of data which is instead scanned upon arrival and summarized through a succinct data structure in order to maintain only the information of interest. Two of the main tasks related to data stream processing are frequency estimation and heavy hitter detection. The frequency estimation problem requires estimating the frequency of each item, that is the number of times or the weight with which each appears in the stream, while heavy hitter detection means the detection of all those items with a frequency higher than a fixed threshold. In this work we design and analyze ACMSS, an algorithm for frequency estimation and heavy hitter detection, and compare it against the state of the art ASketch algorithm. We show that, given the same budgeted amount of memory, for the task of frequency estimation our algorithm outperforms ASketch with regard to accuracy. Furthermore, we show that, under the assumptions stated by its authors, ASketch may not be able to report all of the heavy hitters whilst ACMSS will provide with high probability the full list of heavy hitters.

 Artículos similares

       
 
Chi Zhang, Zhong Yang, Haoze Zhuo, Luwei Liao, Xin Yang, Tang Zhu and Guotao Li    
Self-localization and state estimation are crucial capabilities for agile drone autonomous navigation. This article presents a lightweight and drift-free vision-IMU-GNSS tightly coupled multisensor fusion (LDMF) strategy for drones? autonomous and safe n... ver más
Revista: Drones

 
Hadis Pakdel, Dev Raj Paudyal, Sreeni Chadalavada, Md Jahangir Alam and Majid Vazifedoust    
The frequency and severity of extremes, including extreme precipitation events, extreme evapotranspiration and extreme water storage deficit events, are changing. Thus, the necessity for developing a framework that estimates non-stationary conditions is ... ver más

 
Minerva Singh and Xin Cai    
Coastal flooding has been a significant hazard in Hong Kong. Influenced by climate change, extreme coastal flooding events have been frequently observed in the past decades. Nowadays, the real estate sector has increasingly recognized the significance of... ver más

 
Faris Tre?njo, Mustafa Humo, Filippo Casarin and Naida Ademovic    
Minarets, tall structures, connected or not to the mosque attract attention due to their specific architectural features. Vulnerability to seismic damage has been witnessed throughout history on tall and slender structures after earthquake ground motions... ver más
Revista: Buildings

 
Sul-Min Yun, Ji-Hye Jeong, Hang-Tak Jeon, Jae-Yeol Cheong and Se-Yeong Hamm    
Groundwater droughts are one of the natural disasters that raise serious water issues for humans, and are increasing in frequency due to global climate change. In order to identify groundwater droughts, we recorded groundwater level fluctuations upstream... ver más
Revista: Water