Inicio  /  Informatics  /  Vol: 6 Par: 1 (2019)  /  Artículo
ARTÍCULO
TITULO

Selective Wander Join: Fast Progressive Visualizations for Data Joins

Marianne Procopio    
Carlos Scheidegger    
Eugene Wu and Remco Chang    

Resumen

Progressive visualization offers a great deal of promise for big data visualization; however, current progressive visualization systems do not allow for continuous interaction. What if users want to see more confident results on a subset of the visualization? This can happen when users are in exploratory analysis mode but want to ask some directed questions of the data as well. In a progressive visualization system, the online aggregation algorithm determines the database sampling rate and resulting convergence rate, not the user. In this paper, we extend a recent method in online aggregation, called Wander Join, that is optimized for queries that join tables, one of the most computationally expensive operations. This extension leverages importance sampling to enable user-driven sampling when data joins are in the query. We applied user interaction techniques that allow the user to view and adjust the convergence rate, providing more transparency and control over the online aggregation process. By leveraging importance sampling, our extension of Wander Join also allows for stratified sampling of groups when there is data distribution skew. We also improve the convergence rate of filtering queries, but with additional overhead costs not needed in the original Wander Join algorithm.

 Artículos similares

       
 
Junlin Lou, Burak Yuksek, Gokhan Inalhan and Antonios Tsourdos    
In this study, we consider the problem of motion planning for urban air mobility applications to generate a minimal snap trajectory and trajectory that cost minimal time to reach a goal location in the presence of dynamic geo-fences and uncertainties in ... ver más
Revista: Aerospace

 
Thomas Parr, Karl Friston and Peter Zeidman    
Bayesian inference typically focuses upon two issues. The first is estimating the parameters of some model from data, and the second is quantifying the evidence for alternative hypotheses?formulated as alternative models. This paper focuses upon a third ... ver más
Revista: Algorithms

 
Swagat Bhattacharyya and Jennifer O. Hasler    
While wireless sensor node (WSNs) have proliferated with the rise of the Internet of Things (IoT), uniformly sampled analog?digital converters (ADCs) have traditionally reigned paramount in the signal processing pipeline. The large volume of data generat... ver más

 
Firas Alghanim, Ibrahim Al-Hurani, Hazem Qattous, Abdullah Al-Refai, Osamah Batiha, Abedalrhman Alkhateeb and Salama Ikki    
Identifying menopause-related breast cancer biomarkers is crucial for enhancing diagnosis, prognosis, and personalized treatment at that stage of the patient?s life. In this paper, we present a comprehensive framework for extracting multiomics biomarkers... ver más
Revista: Algorithms

 
Jinjia Zhou and Jian Yang    
Compressive Sensing (CS) has emerged as a transformative technique in image compression, offering innovative solutions to challenges in efficient signal representation and acquisition. This paper provides a comprehensive exploration of the key components... ver más
Revista: Information