Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

Charl Maree and Christian Omlin

Resumen

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents? behaviour. In this study, we have instead developed a method to imbue agents? policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents? behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers? investment portfolios based on their spending personalities.

Palabras claves

explainable AI - multi-agent systems - deterministic policy gradients

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 3 Parte: 2 (2022)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Algorithms
Aerospace
Information

DOI

https://doi.org/10.3390/ai3020015

Art�culos similares

Aircraft Upset Recovery Strategy and Pilot Assistance System Based on Reinforcement Learning

Acceso

Jin Wang, Peng Zhao, Zhe Zhang, Ting Yue, Hailiang Liu and Lixin Wang

The upset state is an unexpected flight state, which is characterized by an unintentional deviation from normal operating parameters. It is difficult for the pilot to recover the aircraft from the upset state accurately and quickly. In this paper, an ups... ver m�s

Revista: Aerospace

Model-Reference Reinforcement Learning for Safe Aerial Recovery of Unmanned Aerial Vehicles

Acceso

Bocheng Zhao, Mingying Huo, Ze Yu, Naiming Qi and Jianfeng Wang

In this study, we propose an aerial rendezvous method to facilitate the recovery of unmanned aerial vehicles (UAVs) using carrier aircrafts, which is an important capability for the future use of UAVs. The main contribution of this study is the developme... ver m�s

Revista: Aerospace

Deep Reinforcement Learning for Autonomous Driving in Amazon Web Services DeepRacer

Acceso

Bohdan Petryshyn, Serhii Postupaiev, Soufiane Ben Bari and Armantas Ostreika

The development of autonomous driving models through reinforcement learning has gained significant traction. However, developing obstacle avoidance systems remains a challenge. Specifically, optimising path completion times while navigating obstacles is ... ver m�s

Revista: Information

Scheduling for the Flexible Job-Shop Problem with a Dynamic Number of Machines Using Deep Reinforcement Learning

Acceso

Yu-Hung Chang, Chien-Hung Liu and Shingchern D. You

The dynamic flexible job-shop problem (DFJSP) is a realistic and challenging problem that many production plants face. As the product line becomes more complex, the machines may suddenly break down or resume service, so we need a dynamic scheduling frame... ver m�s

Revista: Information

Routing Control Optimization for Autonomous Vehicles in Mixed Traffic Flow Based on Deep Reinforcement Learning

Acceso

Sungwon Moon, Seolwon Koo, Yujin Lim and Hyunjin Joo

With recent technological advancements, the commercialization of autonomous vehicles (AVs) is expected to be realized soon. However, it is anticipated that a mixed traffic of AVs and human-driven vehicles (HVs) will persist for a considerable period unti... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles