ARTÍCULO
TITULO

Feature Selection of Network Intrusion Data using Genetic Algorithm and Particle Swarm Optimization

Iwan Syarif    

Resumen

This paper describes the advantages of using Evolutionary Algorithms (EA) for feature selection on network intrusion dataset. Most current Network Intrusion Detection Systems (NIDS) are unable to detect intrusions in real time because of high dimensional data produced during daily operation. Extracting knowledge from huge data such as intrusion data requires new approach. The more complex the datasets, the higher computation time and the harder they are to be interpreted and analyzed. This paper investigates the performance of feature selection algoritms in network intrusiona data. We used Genetic Algorithms (GA) and Particle Swarm Optimizations (PSO) as feature selection algorithms. When applied to network intrusion datasets, both GA and PSO have significantly reduces the number of features. Our experiments show that GA successfully reduces the number of attributes from 41 to 15 while PSO reduces the number of attributes from 41 to 9. Using k Nearest Neighbour (k-NN) as a classifier,the GA-reduced dataset which consists of 37% of original attributes, has accuracy improvement from 99.28% to 99.70% and its execution time is also 4.8 faster than the execution time of original dataset. Using the same classifier, PSO-reduced dataset which consists of 22% of original attributes, has the fastest execution time (7.2 times faster than the execution time of original datasets). However, its accuracy is slightly reduced 0.02% from 99.28% to 99.26%. Overall, both GA and PSO are good solution as feature selection techniques because theyhave shown very good performance in reducing the number of features significantly while still maintaining and sometimes improving the classification accuracy as well as reducing the computation time.

 Artículos similares

       
 
Vera Afreixo, Ana Helena Tavares, Vera Enes, Miguel Pinheiro, Leonor Rodrigues and Gabriela Moura    
In this work, we aimed to establish a stable and accurate procedure with which to perform feature selection in datasets with a much higher number of predictors than individuals, as in genome-wide association studies. Due to the instability of feature sel... ver más
Revista: Applied Sciences

 
Zijia Zheng, Yizhu Jiang, Qiutong Zhang, Yanling Zhong and Lizheng Wang    
The timely monitoring of urban water bodies using unmanned aerial vehicle (UAV)-mounted remote sensing technology is crucial for urban water resource protection and management. Addressing the limitations of the use of satellite data in inferring the wate... ver más
Revista: Water

 
Urszula Libal and Pawel Biernacki    
An automatic honey bee classification system based on audio signals for tracking the frequency of workers and drones entering and leaving a hive.
Revista: Applied Sciences

 
Mohammad Shokouhifar, Mohamad Hasanvand, Elaheh Moharamkhani and Frank Werner    
Heart disease is a global health concern of paramount importance, causing a significant number of fatalities and disabilities. Precise and timely diagnosis of heart disease is pivotal in preventing adverse outcomes and improving patient well-being, there... ver más
Revista: Algorithms

 
Marwah Abdulrazzaq Naser, Aso Ahmed Majeed, Muntadher Alsabah, Taha Raad Al-Shaikhli and Kawa M. Kaky    
Cardiovascular disease is the leading cause of global mortality and responsible for millions of deaths annually. The mortality rate and overall consequences of cardiac disease can be reduced with early disease detection. However, conventional diagnostic ... ver más
Revista: Algorithms