Inicio  /  Algorithms  /  Vol: 17 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Framework Based on Simulation of Real-World Message Streams to Evaluate Classification Solutions

Wenny Hojas-Mazo    
Francisco Maciá-Pérez    
José Vicente Berná Martínez    
Mailyn Moreno-Espino    
Iren Lorenzo Fonseca and Juan Pavón    

Resumen

Analysing message streams in a dynamic environment is challenging. Various methods and metrics are used to evaluate message classification solutions, but often fail to realistically simulate the actual environment. As a result, the evaluation can produce overly optimistic results, rendering current solution evaluations inadequate for real-world environments. This paper proposes a framework based on the simulation of real-world message streams to evaluate classification solutions. The framework consists of four modules: message stream simulation, processing, classification and evaluation. The simulation module uses techniques and queueing theory to replicate a real-world message stream. The processing module refines the input messages for optimal classification. The classification module categorises the generated message stream using existing solutions. The evaluation module evaluates the performance of the classification solutions by measuring accuracy, precision and recall. The framework can model different behaviours from different sources, such as different spammers with different attack strategies, press media or social network sources. Each profile generates a message stream that is combined into the main stream for greater realism. A spam detection case study is developed that demonstrates the implementation of the proposed framework and identifies latency and message body obfuscation as critical classification quality parameters.

 Artículos similares

       
 
Lingqi Kong and Shengquau Liu    
With the development of the Internet, vast amounts of text information are being generated constantly. Methods for extracting the valuable parts from this information have become an important research field. Relation extraction aims to identify entities ... ver más
Revista: Applied Sciences

 
Alya Alshammari and Khalil El Hindi    
The combination of collaborative deep learning and Cyber-Physical Systems (CPSs) has the potential to improve decision-making, adaptability, and efficiency in dynamic and distributed environments. However, it brings privacy, communication, and resource r... ver más
Revista: Applied Sciences

 
Jizhao Wang, Yunyi Liang, Jinjun Tang and Zhizhou Wu    
This research contributes to the development of a technological method to obtain highly accurate vehicle trajectory data. The reconstructed trajectory data play a key role in traffic state prediction, traffic management and the decision making of autonom... ver más
Revista: Applied Sciences

 
Sorin Zoican, Roxana Zoican, Dan Galatchi and Marius Vochin    
This paper illustrates a general framework in which a neural network application can be easily integrated and proposes a traffic forecasting approach that uses neural networks based on graphs. Neural networks based on graphs have the advantage of capturi... ver más
Revista: Applied Sciences

 
Diana Bratic, Marko ?apina, Denis Jurecic and Jana ?iljak Gr?ic    
This paper addresses the challenges associated with the centralized storage of educational materials in the context of a fragmented and disparate database. In response to the increasing demands of modern education, efficient and accessible retrieval of m... ver más