Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Algorithms  /  Vol: 17 Par: 2 (2024)  /  Artículo
ARTÍCULO
TITULO

Efficient Time-Series Clustering through Sparse Gaussian Modeling

Dimitris Fotakis    
Panagiotis Patsilinakos    
Eleni Psaroudaki and Michalis Xefteris    

Resumen

In this work, we consider the problem of shape-based time-series clustering with the widely used Dynamic Time Warping (DTW) distance. We present a novel two-stage framework based on Sparse Gaussian Modeling. In the first stage, we apply Sparse Gaussian Process Regression and obtain a sparse representation of each time series in the dataset with a logarithmic (in the original length T) number of inducing data points. In the second stage, we apply k-means with DTW Barycentric Averaging (DBA) to the sparsified dataset using a generalization of DTW, which accounts for the fact that each inducing point serves as a representative of many original data points. The asymptotic running time of our Sparse Time-Series Clustering framework is Ω(T2/log2T)" role="presentation">O(??2/log2??)O(T2/log2T) O ( T 2 / log 2 T ) times faster than the running time of applying k-means to the original dataset because sparsification reduces the running time of DTW from Θ(T2)" role="presentation">T(??2)T(T2) T ( T 2 ) to Θ(log2T)" role="presentation">T(log2??)T(log2T) T ( log 2 T ) . Moreover, sparsification tends to smoothen outliers and particularly noisy parts of the original time series. We conduct an extensive experimental evaluation using datasets from the UCR Time-Series Classification Archive, showing that the quality of clustering computed by our Sparse Time-Series Clustering framework is comparable to the clustering computed by the standard k-means algorithm.

 Artículos similares

       
 
Louis Closson, Christophe Cérin, Didier Donsez and Jean-Luc Baudouin    
This paper aims to provide discernment toward establishing a general framework, dedicated to data analysis and forecasting in smart buildings. It constitutes an industrial return of experience from an industrialist specializing in IoT supported by the ac... ver más
Revista: Information

 
Natalí Carbo-Bustinza, Hasnain Iftikhar, Marisol Belmonte, Rita Jaqueline Cabello-Torres, Alex Rubén Huamán De La Cruz and Javier Linkolk López-Gonzales    
In the modern era, air pollution is one of the most harmful environmental issues on the local, regional, and global stages. Its negative impacts go far beyond ecosystems and the economy, harming human health and environmental sustainability. Given these ... ver más
Revista: Applied Sciences

 
Naoki Nishimura, Noriyoshi Sukegawa, Yuichi Takano and Jiro Iwanaga    
This paper examines the relationship between user pageview (PV) histories and their itemchoice behavior on an e-commerce website. We focus on PV sequences, which represent time series of the number of PVs for each user?item pair. We propose a shape-restr... ver más
Revista: Algorithms

 
Deokhwan Kim, Cheolhee Jang, Jeonghyeon Choi and Jaewon Kwak    
As a significant portion of the available water resources in volcanic terrains such as Jeju Island are dependent on groundwater, reliable groundwater level forecasting is one of the important tasks for efficient water resource management. This study aims... ver más
Revista: Water

 
Shiva Gopal Shrestha and Soni M. Pradhanang    
The general practice of rainfall-runoff model development towards physically based and spatially explicit representations of hydrological processes is data-intensive and computationally expensive. Physically based models such as the Soil Water Assessment... ver más
Revista: Water