Inicio  /  Computers  /  Vol: 8 Par: 1 (2019)  /  Artículo
ARTÍCULO
TITULO

J48SS: A Novel Decision Tree Approach for the Handling of Sequential and Time Series Data

Andrea Brunello    
Enrico Marzano    
Angelo Montanari and Guido Sciavicco    

Resumen

Temporal information plays a very important role in many analysis tasks, and can be encoded in at least two different ways. It can be modeled by discrete sequences of events as, for example, in the business intelligence domain, with the aim of tracking the evolution of customer behaviors over time. Alternatively, it can be represented by time series, as in the stock market to characterize price histories. In some analysis tasks, temporal information is complemented by other kinds of data, which may be represented by static attributes, e.g., categorical or numerical ones. This paper presents J48SS, a novel decision tree inducer capable of natively mixing static (i.e., numerical and categorical), sequential, and time series data for classification purposes. The novel algorithm is based on the popular C4.5 decision tree learner, and it relies on the concepts of frequent pattern extraction and time series shapelet generation. The algorithm is evaluated on a text classification task in a real business setting, as well as on a selection of public UCR time series datasets. Results show that it is capable of providing competitive classification performances, while generating highly interpretable models and effectively reducing the data preparation effort.

 Artículos similares

       
 
Ziyi Wang, Xinran Li, Luoyang Sun, Haifeng Zhang, Hualin Liu and Jun Wang    
Efficient yet sufficient exploration remains a critical challenge in reinforcement learning (RL), especially for Markov Decision Processes (MDPs) with vast action spaces. Previous approaches have commonly involved projecting the original action space int... ver más
Revista: Algorithms

 
Nikolaos Zafeiropoulos, Pavlos Bitilis, George E. Tsekouras and Konstantinos Kotis    
In the realm of Parkinson?s Disease (PD) research, the integration of wearable sensor data with personal health records (PHR) has emerged as a pivotal avenue for patient alerting and monitoring. This study delves into the complex domain of PD patient car... ver más
Revista: Information

 
Anibal Pedraza, Lucia Gonzalez, Oscar Deniz and Gloria Bueno    
HER2 overexpression is a prognostic and predictive factor observed in about 15% to 20% of breast cancer cases. The assessment of its expression directly affects the selection of treatment and prognosis. The measurement of HER2 status is performed by an e... ver más
Revista: Algorithms

 
Rola R. Hassan, Manar Abu Talib, Fikri Dweiri and Jorge Roman    
Implementing the European Foundation for Quality Management (EFQM) business excellence model in organizations is time- and cost-consuming. The integration of artificial intelligence (AI) into the EFQM business excellence model is a promising approach to ... ver más
Revista: Applied Sciences

 
Francisca Lanai Ribeiro Torres, Luana Medeiros Marangon Lima, Michelle Simões Reboita, Anderson Rodrigo de Queiroz and José Wanderley Marangon Lima    
Streamflow forecasting plays a crucial role in the operational planning of hydro-dominant power systems, providing valuable insights into future water inflows to reservoirs and hydropower plants. It relies on complex mathematical models, which, despite t... ver más
Revista: Water