ARTÍCULO
TITULO

Pareto Optimal Solutions for Network Defense Strategy Selection Simulator in Multi-Objective Reinforcement Learning

Yang Sun    
Yun Li    
Wei Xiong    
Zhonghua Yao    
Krishna Moniz and Ahmed Zahir    

Resumen

Using Pareto optimization in Multi-Objective Reinforcement Learning (MORL) leads to better learning results for network defense games. This is particularly useful for network security agents, who must often balance several goals when choosing what action to take in defense of a network. If the defender knows his preferred reward distribution, the advantages of Pareto optimization can be retained by using a scalarization algorithm prior to the implementation of the MORL. In this paper, we simulate a network defense scenario by creating a multi-objective zero-sum game and using Pareto optimization and MORL to determine optimal solutions and compare those solutions to different scalarization approaches. We build a Pareto Defense Strategy Selection Simulator (PDSSS) system for assisting network administrators on decision-making, specifically, on defense strategy selection, and the experiment results show that the Satisficing Trade-Off Method (STOM) scalarization approach performs better than linear scalarization or GUESS method. The results of this paper can aid network security agents attempting to find an optimal defense policy for network security games.

 Artículos similares

       
 
Na Wei, Yuxin Peng, Kunming Lu, Guixing Zhou, Xingtao Guo and Minghui Niu    
The parallel reservoirs in the upper reach of the Hanjiang River are key projects for watershed management, development, and protection. The optimal operation of parallel reservoirs is a multiple-stage, multiple-objective, and multiple-decision attribute... ver más
Revista: Applied Sciences

 
Jafar Jafari-Asl, Seyed Arman Hashemi Monfared and Soroush Abolfathi    
This study investigates the optimal and safe operation of pumping stations in water distribution systems (WDSs) with the aim of reducing the environmental footprint of water conveyance processes. We introduced the nonlinear chaotic honey badger algorithm... ver más
Revista: Water

 
Mansoor Davoodi and Justin M. Calabrese    
The optimal placement of healthcare facilities, including the placement of diagnostic test centers, plays a pivotal role in ensuring efficient and equitable access to healthcare services. However, the emergence of unique complexities in the context of a ... ver más
Revista: Algorithms

 
Vedat Dogan and Steven Prestwich    
In a multi-objective optimization problem, a decision maker has more than one objective to optimize. In a bilevel optimization problem, there are the following two decision-makers in a hierarchy: a leader who makes the first decision and a follower who r... ver más
Revista: Algorithms

 
Lu Sun, Bao Zhang, Ping Wang, Zhihong Gan, Pengpeng Han and Yijian Wang    
The process of intelligent multi-objective parametric optimization design for mirrors is discussed in detail in this paper, with the error of the mirror surface shape and the total mass being examined as the optimization objectives. The establishment of ... ver más
Revista: Applied Sciences