Inicio  /  Applied Sciences  /  Vol: 13 Par: 11 (2023)  /  Artículo
ARTÍCULO
TITULO

Improving Reliability for Detecting Anomalies in the MQTT Network by Applying Correlation Analysis for Feature Selection Using Machine Learning Techniques

Imran    
Megat Farez Azril Zuhairi    
Syed Mubashir Ali    
Zeeshan Shahid    
Muhammad Mansoor Alam and Mazliham Mohd Su?ud    

Resumen

Anomaly detection (AD) has captured a significant amount of focus from the research field in recent years, with the rise of the Internet of Things (IoT) application. Anomalies, often known as outliers, are defined as the discovery of anomalous occurrences or observations that differ considerably from the mainstream of the data. The IoT which is described as a network of Internet-based digital sensors that continuously generate massive volumes of data and use to communicate with one another theMessage Queuing Telemetry Transport (MQTT) protocol. Brute-force, Denial-of-Service (DoS), Malformed, Flood, and Slowite attacks are the most common in theMQTT network. One of the significant factors in IoT AD is the time consumed to predict an attack and take preemptive measures. For instance, if an attack is detected late, the loss of attack is irreversible. This paper investigates the time to detect an attack using machine learning approaches and proposes a novel approach that applies correlation analysis to reduce the training and testing time of these algorithms. The new approach has been evaluated on Random Forest, Decision Tree, Naïve Bayes, Multi-Layer Perceptron, Artificial Neural Network, Logistic Regression, and Gradient Boost. The findings indicate that the correlation analysis is significantly beneficial in the process of feature engineering, primarily to determine the most relevant features in the MQTT dataset. This is, to the best of our knowledge, the first study on MQTTset that reduces the prediction time for DoS 0.92 (95% CI -0.378, 2.22) reduced to 0.77 (95% CI -0.414, 1.97) and for Malformed 2.92 (95% CI -2.6, 8.44) reduced to 0.49 (95% CI -0.273, 1.25).

 Artículos similares

       
 
Dario Autovino, Vincenzo Alagna, Cristina Bondì and Massimo Iovino    
Green roofs can be a valid solution for stormwater management in urban environments. The objective of this study was to develop a laboratory procedure for the hydraulic characterization of artificial substrates, used in the realization of green roofs, ba... ver más
Revista: Applied Sciences

 
Leon Kopitar, Iztok Fister, Jr. and Gregor Stiglic    
Introduction: Type 2 diabetes mellitus is a major global health concern, but interpreting machine learning models for diagnosis remains challenging. This study investigates combining association rule mining with advanced natural language processing to im... ver más
Revista: Information

 
Haoxiang Shi, Jun Ai, Jingyu Liu and Jiaxi Xu    
Software defect prediction is a popular method for optimizing software testing and improving software quality and reliability. However, software defect datasets usually have quality problems, such as class imbalance and data noise. Oversampling by genera... ver más
Revista: Applied Sciences

 
Soojeong Lee, Gyanendra Prasad Joshi, Anish Prasad Shrestha, Chang-Hwan Son and Gangseong Lee    
Cuffless blood pressure (BP) monitoring is crucial for patients with cardiovascular disease and hypertension. However, conventional BP monitors provide only single-point estimates without confidence intervals. Therefore, the statistical variability in th... ver más
Revista: Applied Sciences

 
Ming Chen, Kai Huang, Jian Wang, Wenzhi Liu and Yuanyuan Shi    
The reliability of urban transportation systems is crucial for ensuring smooth traffic flow and minimizing disruptions caused by external factors. This study focuses on improving the stability and efficiency of transportation systems through the calibrat... ver más
Revista: Applied Sciences