ARTÍCULO
TITULO

COLREGs-Based Path Planning for USVs Using the Deep Reinforcement Learning Strategy

Naifeng Wen    
Yundong Long    
Rubo Zhang    
Guanqun Liu    
Wenjie Wan and Dian Jiao    

Resumen

This research introduces a two-stage deep reinforcement learning approach for the cooperative path planning of unmanned surface vehicles (USVs). The method is designed to address cooperative collision-avoidance path planning while adhering to the International Regulations for Preventing Collisions at Sea (COLREGs) and considering the collision-avoidance problem within the USV fleet and between USVs and target ships (TSs). To achieve this, the study presents a dual COLREGs-compliant action-selection strategy to effectively manage the vessel-avoidance problem. Firstly, we construct a COLREGs-compliant action-evaluation network that utilizes a deep learning network trained on pre-recorded TS avoidance trajectories by USVs in compliance with COLREGs. Then, the COLREGs-compliant reward-function-based action-selection network is proposed by considering various TS encountering scenarios. Consequently, the results of the two networks are fused to select actions for cooperative path-planning processes. The path-planning model is established using the multi-agent proximal policy optimization (MAPPO) method. The action space, observation space, and reward function are tailored for the policy network. Additionally, a TS detection method is introduced to detect the motion intentions of TSs. The study conducted Monte Carlo simulations to demonstrate the strong performance of the planning method. Furthermore, experiments focusing on COLREGs-based TS avoidance were carried out to validate the feasibility of the approach. The proposed TS detection model exhibited robust performance within the defined task.

 Artículos similares

       
 
Qing Yang, Bingyu Song, Yingguo Chen, Lei He and Pei Wang    
With the improvement of satellite autonomy, multi-satellite cooperative mission planning has become an important application. This requires multiple satellites to interact with each other via inter-satellite links to reach a consistent mission planning s... ver más
Revista: Algorithms

 
Okan Asik, Fatma Basak Aydemir and Hüseyin Levent Akin    
The number of agents exponentially increases the complexity of a cooperative multi-agent planning problem. Decoupled planning is one of the viable approaches to reduce this complexity. By integrating decoupled planning with Monte Carlo Tree Search, we pr... ver más
Revista: Applied Sciences

 
Jian Zheng, Wenjun Sun, Yun Li and Jiayin Hu    
In order to solve the multi-objective planning and trajectory tracking control problem related to maritime autonomous surface ships (MASSs), a new design scheme for autonomous navigation is proposed in this paper, with a receding horizon navigation and c... ver más

 
Jun Long, Shimin Wu, Xiaodong Han, Yunbo Wang and Limin Liu    
The increasing number of satellites for specific space tasks makes it difficult for traditional satellite task planning that relies on ground station planning and on-board execution to fully exploit the overall effectiveness of satellites. Meanwhile, the... ver más
Revista: Aerospace

 
Bingyu Song, Yingwu Chen, Qing Yang, Yahui Zuo, Shilong Xu and Yuning Chen    
The multi-satellite on-board observation planning (MSOOP) is a variant of the multi-agent task allocation problem (MATAP). MSOOP is used to complete the observation task allocation in a fully cooperative mode to maximize the profits of the whole system. ... ver más
Revista: Algorithms