ARTÍCULO
TITULO

An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning

Yubing Mao    
Farong Gao    
Qizhong Zhang and Zhangyi Yang    

Resumen

This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-agent GAIL (MAG) algorithm is proposed. The GAIL enables the AUV to directly learn from expert demonstrations, overcoming the difficulty of slow initial training of the network. Parallel training of multi-agents reduces the high correlation between samples to avoid local convergence. In addition, a reward function is designed to help training. Finally, the results show that in the unity simulation platform test, the proposed algorithm has a strong optimal decision-making ability in the tracking process.

 Artículos similares

       
 
Jiqing Du, Dan Zhou, Wei Wang and Sachiyo Arai    
The Deep Reinforcement Learning (DRL) algorithm is an optimal control method with generalization capacity for complex nonlinear coupled systems. However, the DRL agent maintains control command saturation and response overshoot to achieve the fastest res... ver más

 
Dianrui Wang, Junhe Wan, Yue Shen, Ping Qin and Bo He    
An accurate mathematical model is a basis for controlling and estimating the state of an Autonomous underwater vehicle (AUV) system, so how to improve its accuracy is a fundamental problem in the field of automatic control. However, AUV systems are compl... ver más

 
Pengcheng Mu, Xin Zhang, Ping Qin and Bo He    
Simultaneous Localization and Mapping (SLAM) is a well-known solution for mapping and realizing autonomous navigation of an Autonomous Underwater Vehicle (AUV) in unknown underwater environments. However, the inaccurate time-varying observation noise wil... ver más

 
Yushan Sun, Xiaokun Luo, Xiangrui Ran and Guocheng Zhang    
This research aims to solve the safe navigation problem of autonomous underwater vehicles (AUVs) in deep ocean, which is a complex and changeable environment with various mountains. When an AUV reaches the deep sea navigation, it encounters many underwat... ver más

 
Timothy Sands    
The major premise of deterministic artificial intelligence (D.A.I.) is to assert deterministic self-awareness statements based in either the physics of the underlying problem or system identification to establish governing differential equations. The key... ver más