REVISTA
AI

   
Inicio  /  AI  /  Vol: 3 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

Charl Maree and Christian Omlin    

Resumen

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents? behaviour. In this study, we have instead developed a method to imbue agents? policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents? behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers? investment portfolios based on their spending personalities.

 Artículos similares

       
 
Jin Wang, Peng Zhao, Zhe Zhang, Ting Yue, Hailiang Liu and Lixin Wang    
The upset state is an unexpected flight state, which is characterized by an unintentional deviation from normal operating parameters. It is difficult for the pilot to recover the aircraft from the upset state accurately and quickly. In this paper, an ups... ver más
Revista: Aerospace

 
Bocheng Zhao, Mingying Huo, Ze Yu, Naiming Qi and Jianfeng Wang    
In this study, we propose an aerial rendezvous method to facilitate the recovery of unmanned aerial vehicles (UAVs) using carrier aircrafts, which is an important capability for the future use of UAVs. The main contribution of this study is the developme... ver más
Revista: Aerospace

 
Bohdan Petryshyn, Serhii Postupaiev, Soufiane Ben Bari and Armantas Ostreika    
The development of autonomous driving models through reinforcement learning has gained significant traction. However, developing obstacle avoidance systems remains a challenge. Specifically, optimising path completion times while navigating obstacles is ... ver más
Revista: Information

 
Yu-Hung Chang, Chien-Hung Liu and Shingchern D. You    
The dynamic flexible job-shop problem (DFJSP) is a realistic and challenging problem that many production plants face. As the product line becomes more complex, the machines may suddenly break down or resume service, so we need a dynamic scheduling frame... ver más
Revista: Information

 
Sungwon Moon, Seolwon Koo, Yujin Lim and Hyunjin Joo    
With recent technological advancements, the commercialization of autonomous vehicles (AVs) is expected to be realized soon. However, it is anticipated that a mixed traffic of AVs and human-driven vehicles (HVs) will persist for a considerable period unti... ver más
Revista: Applied Sciences