Inicio  /  Algorithms  /  Vol: 14 Par: 12 (2021)  /  Artículo
ARTÍCULO
TITULO

Hexadecimal Aggregate Approximation Representation and Classification of Time Series Data

Zhenwen He    
Chunfeng Zhang    
Xiaogang Ma and Gang Liu    

Resumen

Time series data are widely found in finance, health, environmental, social, mobile and other fields. A large amount of time series data has been produced due to the general use of smartphones, various sensors, RFID and other internet devices. How a time series is represented is key to the efficient and effective storage and management of time series data, as well as being very important to time series classification. Two new time series representation methods, Hexadecimal Aggregate approXimation (HAX) and Point Aggregate approXimation (PAX), are proposed in this paper. The two methods represent each segment of a time series as a transformable interval object (TIO). Then, each TIO is mapped to a spatial point located on a two-dimensional plane. Finally, the HAX maps each point to a hexadecimal digit so that a time series is converted into a hex string. The experimental results show that HAX has higher classification accuracy than Symbolic Aggregate approXimation (SAX) but a lower one than some SAX variants (SAX-TD, SAX-BD). The HAX has the same space cost as SAX but is lower than these variants. The PAX has higher classification accuracy than HAX and is extremely close to the Euclidean distance (ED) measurement; however, the space cost of PAX is generally much lower than the space cost of ED. HAX and PAX are general representation methods that can also support geoscience time series clustering, indexing and query except for classification.

Palabras claves

time series -  SAX -  PAA -  HAX -  PAX

 Artículos similares

       
 
Yong Zhang, Xin Wang, Zongli Jiang, Junfeng Wei, Hiroyuki Enomoto and Tetsuo Ohata    
Arctic glaciers comprise a small fraction of the world?s land ice area, but their ongoing mass loss currently represents a large cryospheric contribution to the sea level rise. In the Suntar-Khayata Mountains (SKMs) of northeastern Siberia, in situ measu... ver más
Revista: Water

 
Jianzhao Liu, Liping Gao, Fenghui Yuan, Yuedong Guo and Xiaofeng Xu    
Soil water shortage is a critical issue for the Southwest US (SWUS), the typical arid region that has experienced severe droughts over the past decades, primarily caused by climate change. However, it is still not quantitatively understood how soil water... ver más
Revista: Water

 
Angel E. Muñoz-Zavala, Jorge E. Macías-Díaz, Daniel Alba-Cuéllar and José A. Guerrero-Díaz-de-León    
This paper reviews the application of artificial neural network (ANN) models to time series prediction tasks. We begin by briefly introducing some basic concepts and terms related to time series analysis, and by outlining some of the most popular ANN arc... ver más
Revista: Algorithms

 
Dimitris Fotakis, Panagiotis Patsilinakos, Eleni Psaroudaki and Michalis Xefteris    
In this work, we consider the problem of shape-based time-series clustering with the widely used Dynamic Time Warping (DTW) distance. We present a novel two-stage framework based on Sparse Gaussian Modeling. In the first stage, we apply Sparse Gaussian P... ver más
Revista: Algorithms

 
Nicholas V. Sarlis, Efthimios S. Skordas, Stavros-Richard G. Christopoulos and Panayiotis K. Varotsos    
Here, we employ natural time analysis of seismicity together with non-extensive statistical mechanics aiming at shortening the occurrence time window of the Kahramanmaras-Gazientep M7.8 earthquake. The results obtained are in the positive direction point... ver más
Revista: Applied Sciences