Redirigiendo al acceso original de articulo en 21 segundos...
ARTÍCULO
TITULO

Boosting Computational Effectiveness in Big Spatial Flow Data Analysis with Intelligent Data Reduction

Ran Tao    
Zhaoya Gong    
Qiwei Ma and Jean-Claude Thill    

Resumen

One of the enduring issues of spatial origin-destination (OD) flow data analysis is the computational inefficiency or even the impossibility to handle large datasets. Despite the recent advancements in high performance computing (HPC) and the ready availability of powerful computing infrastructure, we argue that the best solutions are based on a thorough understanding of the fundamental properties of the data. This paper focuses on overcoming the computational challenge through data reduction that intelligently takes advantage of the heavy-tailed distributional property of most flow datasets. We specifically propose the classification technique of head/tail breaks to this end. We test this approach with representative algorithms from three common method families, namely flowAMOEBA from flow clustering, Louvain from network community detection, and PageRank from network centrality algorithms. A variety of flow datasets are adopted for the experiments, including inter-city travel flows, cellphone call flows, and synthetic flows. We propose a standard evaluation framework to evaluate the applicability of not only the selected three algorithms, but any given method in a systematic way. The results prove that head/tail breaks can significantly improve the computational capability and efficiency of flow data analyses while preserving result quality, on condition that the analysis emphasizes the ?head? part of the dataset or the flows with high absolute values. We recommend considering this easy-to-implement data reduction technique before analyzing a large flow dataset.

 Artículos similares

       
 
Ming-Yen Lin, Ping-Chun Wu and Sue-Chen Hsueh    
This study introduces session-aware recommendation models, leveraging GRU (Gated Recurrent Unit) and attention mechanisms for advanced latent interaction data integration. A primary advancement is enhancing latent context, a critical factor for boosting ... ver más
Revista: Future Internet

 
Zijia Zheng, Yizhu Jiang, Qiutong Zhang, Yanling Zhong and Lizheng Wang    
The timely monitoring of urban water bodies using unmanned aerial vehicle (UAV)-mounted remote sensing technology is crucial for urban water resource protection and management. Addressing the limitations of the use of satellite data in inferring the wate... ver más
Revista: Water

 
Hatef Dastour and Quazi K. Hassan    
Having a complete hydrological time series is crucial for water-resources management and modeling. However, this can pose a challenge in data-scarce environments where data gaps are widespread. In such situations, recurring data gaps can lead to unfavora... ver más
Revista: Hydrology

 
Aibing Jin, Prabhat Basnet and Shakil Mahtab    
In deep engineering, rockburst hazards frequently result in injuries, fatalities, and the destruction of contiguous structures. Due to the complex nature of rockbursts, predicting the severity of rockburst damage (intensity) without the aid of computer m... ver más

 
Sanguk Park    
This study aims to enable cost-effective Internet of Things (IoT) system design by removing redundant IoT sensors through the correlation analysis of sensing data collected in a smart home environment. This study also presents a data analysis and predict... ver más
Revista: Buildings