REVISTA
Algorithms

TODAS

Redirigiendo al acceso original de articulo en 22 segundos...

Inicio / Algorithms / Vol: 17 Par: 1 (2024) / Art�culo

ART�CULO

TITULO

Reducing Q-Value Estimation Bias via Mutual Estimation and Softmax Operation in MADRL

Zheng Li

Xinkai Chen

Jiaqing Fu

Ning Xie and Tingting Zhao

Resumen

With the development of electronic game technology, the content of electronic games presents a larger number of units, richer unit attributes, more complex game mechanisms, and more diverse team strategies. Multi-agent deep reinforcement learning shines brightly in this type of team electronic game, achieving results that surpass professional human players. Reinforcement learning algorithms based on Q-value estimation often suffer from Q-value overestimation, which may seriously affect the performance of AI in multi-agent scenarios. We propose a multi-agent mutual evaluation method and a multi-agent softmax method to reduce the estimation bias of Q values in multi-agent scenarios, and have tested them in both the particle multi-agent environment and the multi-agent tank environment we constructed. The multi-agent tank environment we have built has achieved a good balance between experimental verification efficiency and multi-agent game task simulation. It can be easily extended for different multi-agent cooperation or competition tasks. We hope that it can be promoted in the research of multi-agent deep reinforcement learning.

Palabras claves

reinforcement learning - game AI - multi-agent Q-network mutual estimation - softmax bellman operation - reinforcement learning environment

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 17 Parte: 1 (2024)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied System Innovation
Applied Sciences
Information

DOI

https://doi.org/10.3390/a17010036

Art�culos similares

Reliability Estimation for the Joint Waterproof Facilities of Utility Tunnels Based on an Improved Bayesian Weibull Model

Acceso

Fang-Le Peng, Yong-Kang Qiao and Chao Yang

Safety issues are a major concern for the long-term maintenance and operation of utility tunnels, of which the focal point lies in the reliability of critical facilities. Conventional evaluation methods have failed to reflect the time-dependency and obje... ver m�s

Revista: Applied Sciences

A Deep-Sea Broadband Sound Source Depth Estimation Method Based on the Interference Structure of the Compensated Beam Output

Acceso

Yan Liang, Yu Chen, Zhou Meng, Xin Zhou and Yichi Zhang

This paper proposes an underwater broadband target depth estimation method based on the multipath arrival structure in medium and short-range deep-sea environments. The proposed approach involves separating the multipath rays arriving at the vertical lin... ver m�s

Revista: Journal of Marine Science and Engineering

Engineering Supply Chain Transportation Indexes through Big Data Analytics and Deep Learning

Acceso

Damianos P. Sakas, Nikolaos T. Giannakopoulos, Marina C. Terzi and Nikos Kanellos

Deep learning has experienced an increased demand for its capabilities to categorize and optimize operations and provide higher-accuracy information. For this purpose, the implication of deep learning procedures has been described as a vital tool for the... ver m�s

Revista: Applied Sciences

Rendezvous and Proximity Operations in Cislunar Space Using Linearized Dynamics for Estimation

Acceso

David Zuehlke, Madhur Tiwari, Khalid Jebari and Krishna Bhavithavya Kidambi

As interest in Moon exploration grows, and efforts to establish an orbiting outpost intensify, accurate modeling of spacecraft dynamics in cislunar space is becoming increasingly important. Contrary to satellites in Low Earth Orbit (LEO), where it takes ... ver m�s

Revista: Aerospace

Integrating GRU with a Kalman Filter to Enhance Visual Inertial Odometry Performance in Complex Environments

Acceso

Tarafder Elmi Tabassum, Zhengjia Xu, Ivan Petrunin and Zeeshan A. Rana

To enhance system reliability and mitigate the vulnerabilities of the Global Navigation Satellite Systems (GNSS), it is common to fuse the Inertial Measurement Unit (IMU) and visual sensors with the GNSS receiver in the navigation system design, effectiv... ver m�s

Revista: Aerospace

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles