ARTÍCULO
TITULO

Research on Method of Collision Avoidance Planning for UUV Based on Deep Reinforcement Learning

Wei Gao    
Mengxue Han    
Zhao Wang    
Lihui Deng    
Hongjian Wang and Jingfei Ren    

Resumen

A UUV can perform tasks such as underwater surveillance, reconnaissance, surveillance, and tracking by being equipped with sensors and different task modules. Due to the complex underwater environment, the UUV must have good collision avoidance planning algorithms to avoid various underwater obstacles when performing tasks. The existing path planning algorithms take a long time to plan and have poor adaptability to the environment. Some collision-avoidance planning algorithms do not take into account the kinematic limitations of the UUV, thus placing high demands on the performance and control algorithms of UUV. This article proposes a PPO-DWA collision avoidance planning algorithm for the UUV under static unknown obstacles, which is based on the proximal policy optimization (PPO) algorithm and the dynamic window approach (DWA). This algorithm acquires the obstacle information from forward-looking sonar as input and outputs the corresponding continuous actions. The PPO-DWA collision avoidance planning algorithm consists of the PPO algorithm and the modified DWA. The PPO collision avoidance planning algorithm is only responsible for outputting the continuous angular velocity, aiming to reduce the difficulty of training neural networks. The modified DWA acquires obstacle information and the optimal angular velocity from the PPO algorithm as input, and outputs of the linear velocity. The collision avoidance actions output by this algorithm meet the kinematic constraints of UUV, and the algorithm execution time is relatively short. The experimental data demonstrates that the PPO-DWA algorithm can effectively plan smooth collision-free paths in complex obstacle environments, and the execution time of the algorithm is acceptable.

Palabras claves

 Artículos similares

       
 
I. Oktaviani, M. Asril, Y. Aryanti, S. S. Leksikowati     Pág. 47 - 52
The conversion of agricultural land and plantation into an area with high human activity can affect the biodiversity contained in it. The biodiversity of a region can be surveyed and collect in a systematic database to know the wealth of flora and fauna ... ver más

 
Deska Lismawenning Puspitarum, Gita Safitri, Harlina Ardiyanti, Mohamad Samsul Anrokhi     Pág. 192 - 196
Characteristics and Mineral Content of Lampung Tengah Ironsands. This study is carried out to investigate the characteristics and mineral content of ironsands from Bekri, Lampung Tengah Regency by using precipitation method. The research was conducted in... ver más

 
. Handoyo, . Fatkhan, F. D.E. Latief, R. Rizki, H. Y. Hutami     Pág. 20 - 25
Porosity and permeability of the reservoir rocks can be calculated using digital rock physics. This technique is one of the fastest and effective ways to calculate the value of porosity and permeability based on rock sample images. The samples observed i... ver más

 
Fauzan Ahmad Sayed, Nugraha Bintang Wirawan, Ahmad Yudi     Pág. 10 - 19
The high-rise building of steel structures requires strengthening system so that buildings are capable of carrying earthquake loads that may occur. Strengthening of steel structure building can be applied by providing stiffness of the structure in the fo... ver más

 
Chenglin Yang, Dongliang Xu and Xiao Ma    
Due to the increasing severity of network security issues, training corresponding detection models requires large datasets. In this work, we propose a novel method based on generative adversarial networks to synthesize network data traffic. We introduced... ver más
Revista: Applied Sciences