Inicio  /  Drones  /  Vol: 7 Par: 1 (2023)  /  Artículo
ARTÍCULO
TITULO

PPO-Exp: Keeping Fixed-Wing UAV Formation with Deep Reinforcement Learning

Dan Xu    
Yunxiao Guo    
Zhongyi Yu    
Zhenfeng Wang    
Rongze Lan    
Runhao Zhao    
Xinjia Xie and Han Long    

Resumen

Flocking for fixed-Wing Unmanned Aerial Vehicles (UAVs) is an extremely complex challenge due to fixed-wing UAV?s control problem and the system?s coordinate difficulty. Recently, flocking approaches based on reinforcement learning have attracted attention. However, current methods also require that each UAV makes the decision decentralized, which increases the cost and computation of the whole UAV system. This paper researches a low-cost UAV formation system consisting of one leader (equipped with the intelligence chip) with five followers (without the intelligence chip), and proposes a centralized collision-free formation-keeping method. The communication in the whole process is considered and the protocol is designed by minimizing the communication cost. In addition, an analysis of the Proximal Policy Optimization (PPO) algorithm is provided; the paper derives the estimation error bound, and reveals the relationship between the bound and exploration. To encourage the agent to balance their exploration and estimation error bound, a version of PPO named PPO-Exploration (PPO-Exp) is proposed. It can adjust the clip constraint parameter and make the exploration mechanism more flexible. The results of the experiments show that PPO-Exp performs better than the current algorithms in these tasks.

 Artículos similares

       
 
Tao Jin, Wen Zhang, Chunlai Chen, Bin Chen, Yizhou Zhuang and He Zhang    
Deep-learning- and unmanned aerial vehicle (UAV)-based methods facilitate structural crack detection for tall structures. However, contemporary datasets are generally established using images taken with handheld or vehicle-mounted cameras. Thus, these im... ver más
Revista: Buildings

 
Enrique Aldao, Luis M. González-deSantos, Humberto Michinel and Higinio González-Jorge    
In this work, a real-time collision avoidance algorithm was presented for autonomous navigation in the presence of fixed and moving obstacles in building environments. The current implementation is designed for autonomous navigation between waypoints of ... ver más
Revista: Drones

 
Paulius Kavaliauskas, Daumantas ?idanavicius and Andrius Jurelionis    
Current surveying techniques are typically applied to survey the as-is condition of buildings, brownfield sites and infrastructure prior to design. However, within the past decade, these techniques evolved significantly, and their applications can be enh... ver más

 
Aggeliki Kyriou, Konstantinos Nikolakopoulos and Ioannis Koukouvelas    
The detailed and accurate mapping of landscapes and their geomorphological characteristics is a key issue in hazard management. The current study examines whether the image acquisition geometry of unmanned aerial vehicle (UAV) campaigns affects the accur... ver más

 
Xueyan Zhang    
China?s rural population has declined markedly with the acceleration of urbanization and industrialization, but the area under rural homesteads has continued to expand. Proper rural land use and management require large-scale, efficient, and low-cost rur... ver más