REVISTA
Applied Sciences

TODAS

Inicio / Applied Sciences / Vol: 10 Par: 19 (2020) / Art�culo

ART�CULO

TITULO

Self-Adaptive Priority Correction for Prioritized Experience Replay

Hongjie Zhang

Cheng Qu

Jindou Zhang and Jing Li

Resumen

Deep Reinforcement Learning (DRL) is a promising approach for general artificial intelligence. However, most DRL methods suffer from the problem of data inefficiency. To alleviate this problem, DeepMind proposed Prioritized Experience Replay (PER). Though PER improves data utilization, the priorities of most samples in its Experience Memory (EM) are out of date, as only the priorities of a small part of the data are updated while the Q network parameters are updated. Consequently, the difference between storage and real priority distributions gradually increases, which will introduce bias into the gradients of Deep Q-Learning (DQL) and make the DQL update toward a non-ideal direction. In this work, we propose a novel self-adaptive priority correction algorithm named Importance-PER (Imp-PER) to fix the update deviation. Specifically, we predict the sum of real Temporal-Difference error (TD-error) of all data in EM. Data are corrected by an importance weight, which is estimated by the predicted sum and the real TD-error calculated by the latest agent. To control the unbounded importance weight, we use truncated importance sampling with a self-adaptive truncation threshold. The conducted experiments on various games of Atari 2600 with Double Deep Q-Network and MuJoCo with Deep Deterministic Policy Gradient demonstrate that Imp-PER improves the data utilization and final policy quality on discrete states and continuous states tasks without increasing the computational cost.

Palabras claves

deep reinforcement learning - experience replay - importance sampling - DDQN - DDPG

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 10 Parte: 19 (2020)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Water
Journal of Science and Applicative Technology
Journal of Marine Science and Engineering

DOI

https://doi.org/10.3390/app10196925

Art�culos similares

Effect of Data Augmentation on Deep-Learning-Based Segmentation of Long-Axis Cine-MRI

Acceso

Fran�ois Legrand, Richard Macwan, Alain Lalande, Lisa M�tairie and Thomas Decourselle

Automated Cardiac Magnetic Resonance segmentation serves as a crucial tool for the evaluation of cardiac function, facilitating faster clinical assessments that prove advantageous for both practitioners and patients alike. Recent studies have predominant... ver m�s

Revista: Algorithms

A Deep Learning Approach for Trajectory Control of Tilt-Rotor UAV

Acceso

Javensius Sembiring, Rianto Adhy Sasongko, Eduardo I. Bastian, Bayu Aji Raditya and Rayhan Ekananto Limansubroto

This paper investigates the development of a deep learning-based flight control model for a tilt-rotor unmanned aerial vehicle, focusing on altitude, speed, and roll hold systems. Training data is gathered from the X-Plane flight simulator, employing a p... ver m�s

Revista: Aerospace

Transport Infrastructure Management Based on LiDAR Synthetic Data: A Deep Learning Approach with a ROADSENSE Simulator

Acceso

Lino Comesa�a-Cebral, Joaqu�n Mart�nez-S�nchez, Ant�n Nu�ez Seoane and Pedro Arias

Revista: Infrastructures

Overlay-ML: Unioning Memory and Storage Space for On-Device AI on Mobile Devices

Acceso

Cheolhyeon Kwon and Donghyun Kang

Recently, the technologies of on-device AI have been accelerated with the development of new hardware and software platforms. Therefore, many researchers and engineers focus on how to enable ML technologies on mobile devices with limited hardware resourc... ver m�s

Revista: Applied Sciences

Developing a Framework for Data-Driven Generation of Building Information Modeling from Sketches: Enhancing Efficiency in Space Configuration and Building Performance Analysis

Acceso

WoonSeong Jeong, ByungChan Kong and Sang-Guk Yum

The demand for compact housing is on the rise, driven by the need for floor plans that accommodate stakeholders? preferences. However, clients frequently struggle to convey their spatial needs to professionals, such as architects, due to a lack of means ... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas