ARTÍCULO
TITULO

Reference Model-Based Deterministic Policy for Pitch and Depth Control of Autonomous Underwater Vehicle

Jiqing Du    
Dan Zhou    
Wei Wang and Sachiyo Arai    

Resumen

The Deep Reinforcement Learning (DRL) algorithm is an optimal control method with generalization capacity for complex nonlinear coupled systems. However, the DRL agent maintains control command saturation and response overshoot to achieve the fastest response. In this study, a reference model-based DRL control strategy termed Model-Reference Twin Delayed Deep Deterministic (MR-TD3) was proposed for controlling the pitch attitude and depth of an autonomous underwater vehicle (AUV) system. First, a reference model based on an actual AUV system was introduced to an actor?critic structure, where the input of the model was the reference target, the outputs were the smoothed reference targets, and the reference model parameters can adjust the response time and the smoothness. The input commands were limited to the saturation range. Then, the model state, the real state and the reference target were mapped to the control command through the Twin Delayed Deep Deterministic (TD3) agent for training. Finally, the trained neural network was applied to the AUV system environment for pitch and depth experiments. The results demonstrated that the controller can eliminate the response overshoot and control command saturation while improving the robustness, and the method also can extend to other control platforms such as autonomous guided vehicle or unmanned aerial vehicle.

 Artículos similares

       
 
Yulong Liu, Shuxian Liu and Juepu Chen    
Accurate precipitation forecasting is of great significance to social life and economic activities. Due to the influence of various factors such as topography, climate, and altitude, the precipitation in semi-arid and arid areas shows the characteristics... ver más
Revista: Water

 
Muhammad Tallal Saeed, Jahan Zeb Gul, Zareena Kausar, Asif Mahmood Mughal, Zia Mohy Ud Din and Shiyin Qin    
Precise and accurate lower limb rehabilitation in the form of locomotion assistance and gait training through robust control of robotic exoskeletons.
Revista: Applied Sciences

 
Francesca Romana Cavallo, Christofer Toumazou and Konstantin Nikolic    
The modern sedentary lifestyle is negatively influencing human health, and the current guidelines recommend at least 150 min of moderate activity per week. However, the challenge is how to measure human activity in a practical way. While accelerometers a... ver más

 
Yuanming Chen, Xiaobin Hong, Weiguo Chen, Huifang Wang and Tianhui Fan    
The new way of offshore operation and maintenance based on unmanned ships has outstanding advantages. Aiming at the problem of lack of overall understanding of the complex environment above and under the water surface during the operation and maintenance... ver más

 
Gianpietro Di Rito, Romain Kovel, Marco Nardeschi, Nicola Borgarelli and Benedetto Luciano    
The work deals with the model-based characterization of the failure transients of a fail-safe rotary EMA developed by Umbragroup (Italy) for the flap movables of the RACER helicopter-plane by Airbus Helicopters (France). Since the reference application r... ver más
Revista: Aerospace