Inicio  /  Algorithms  /  Vol: 12 Par: 2 (2019)  /  Artículo
ARTÍCULO
TITULO

A Distributed Execution Pipeline for Clustering Trajectories Based on a Fuzzy Similarity Relation

Soufiane Maguerra    
Azedine Boulmakoul    
Lamia Karim and Hassan Badir    

Resumen

The proliferation of indoor and outdoor tracking devices has led to a vast amount of spatial data. Each object can be described by several trajectories that, once analysed, can yield to significant knowledge. In particular, pattern analysis by clustering generic trajectories can give insight into objects sharing the same patterns. Still, sequential clustering approaches fail to handle large volumes of data. Hence, the necessity of distributed systems to be able to infer knowledge in a trivial time interval. In this paper, we detail an efficient, scalable and distributed execution pipeline for clustering raw trajectories. The clustering is achieved via a fuzzy similarity relation obtained by the transitive closure of a proximity relation. Moreover, the pipeline is integrated in Spark, implemented in Scala and leverages the Core and Graphx libraries making use of Resilient Distributed Datasets (RDD) and graph processing. Furthermore, a new simple, but very efficient, partitioning logic has been deployed in Spark and integrated into the execution process. The objective behind this logic is to equally distribute the load among all executors by considering the complexity of the data. In particular, resolving the load balancing issue has reduced the conventional execution time in an important manner. Evaluation and performance of the whole distributed process has been analysed by handling the Geolife project?s GPS trajectory dataset.

 Artículos similares

       
 
Zulfiqar Ahmad, Tayfun Acarer and Wooseong Kim    
To ensure safe, effective, and efficient marine operations, the optimization of maritime communication workflows with a task-oriented scheduling framework is of the utmost importance. Navigation, vessel traffic management, emergency response, and cargo o... ver más

 
Xiaoxiong Liu, Yi Yin, Yuzhan Su and Ruichen Ming    
To solve the problems of autonomous decision making and the cooperative operation of multiple unmanned combat aerial vehicles (UCAVs) in beyond-visual-range air combat, this paper proposes an air combat decision-making method that is based on a multi-age... ver más
Revista: Aerospace

 
Jose I. Aliaga, Maribel Castillo, Sergio Iserte, Iker Martín-Álvarez and Rafael Mayo    
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Performance Computing (HPC) facilities is a cornerstone in the next generation of exascale supercomputers. Process malleability is presented as a straightforwar... ver más
Revista: Applied Sciences

 
Jin Cao, Bo Li, Mengni Fan and Huiyu Liu    
Deep neural network-based computer vision applications have exploded and are widely used in intelligent services for IoT devices. Due to the computationally intensive nature of DNNs, the deployment and execution of intelligent applications in smart scena... ver más
Revista: Algorithms

 
Souhail Meftah, Shuhao Zhang, Bharadwaj Veeravalli and Khin Mi Mi Aung    
The appealing properties of secure hardware solutions such as trusted execution environment (TEE) including low computational overhead, confidentiality guarantee, and reduced attack surface have prompted considerable interest in adopting them for secure ... ver más
Revista: Algorithms