Inicio  /  Algorithms  /  Vol: 16 Par: 6 (2023)  /  Artículo
ARTÍCULO
TITULO

Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift

Zhehu Yuan    
Yinqi Sun and Dennis Shasha    

Resumen

Database and data structure research can improve machine learning performance in many ways. One way is to design better algorithms on data structures. This paper combines the use of incremental computation as well as sequential and probabilistic filtering to enable ?forgetful? tree-based learning algorithms to cope with streaming data that suffers from concept drift. (Concept drift occurs when the functional mapping from input to classification changes over time). The forgetful algorithms described in this paper achieve high performance while maintaining high quality predictions on streaming data. Specifically, the algorithms are up to 24 times faster than state-of-the-art incremental algorithms with, at most, a 2% loss of accuracy, or are at least twice faster without any loss of accuracy. This makes such structures suitable for high volume streaming applications.

 Artículos similares

       
 
Damny Magdaleno Guevara, Yadriel Miranda, Ivett Fuentes, María Garc ía     Pág. 69 - 80
A huge amount of information is represented in XML format. Several tools have been developed to store, and query XML data. It becomes inevitable to develop high performance techniques for efficiently analysing extremely large collections of XML data. O... ver más

 
Jacek G. Puchalski, Janusz D. Fidelus and Pawel Fotowicz    
One of the fundamental challenges in analyzing wind turbine performance is the occurrence of torque creep under load and without load. This phenomenon significantly impacts the proper functioning of torque transducers, thus necessitating the utilization ... ver más
Revista: Algorithms

 
Ioannis G. Tsoulos    
Revista: Algorithms

 
Yunzhou Chen, Shumin Wang, Ziying Gu and Fan Yang    
Spatial population distribution data is the discretization of demographic data into spatial grids, which has vital reference significance for disaster emergency response, disaster assessment, emergency rescue resource allocation, and post-disaster recons... ver más
Revista: Applied Sciences

 
Junlin Lou, Burak Yuksek, Gokhan Inalhan and Antonios Tsourdos    
In this study, we consider the problem of motion planning for urban air mobility applications to generate a minimal snap trajectory and trajectory that cost minimal time to reach a goal location in the presence of dynamic geo-fences and uncertainties in ... ver más
Revista: Aerospace