Learning Output Reference Model Tracking for Higher-Order Nonlinear Systems with Unknown Dynamics

Mircea-Bogdan Radac and Timotei Lala

Resumen

This work suggests a solution for the output reference model (ORM) tracking control problem, based on approximate dynamic programming. General nonlinear systems are included in a control system (CS) and subjected to state feedback. By linear ORM selection, indirect CS feedback linearization is obtained, leading to favorable linear behavior of the CS. The Value Iteration (VI) algorithm ensures model-free nonlinear state feedback controller learning, without relying on the process dynamics. From linear to nonlinear parameterizations, a reliable approximate VI implementation in continuous state-action spaces depends on several key parameters such as problem dimension, exploration of the state-action space, the state-transitions dataset size, and a suitable selection of the function approximators. Herein, we find that, given a transition sample dataset and a general linear parameterization of the Q-function, the ORM tracking performance obtained with an approximate VI scheme can reach the performance level of a more general implementation using neural networks (NNs). Although the NN-based implementation takes more time to learn due to its higher complexity (more parameters), it is less sensitive to exploration settings, number of transition samples, and to the selected hyper-parameters, hence it is recommending as the de facto practical implementation. Contributions of this work include the following: VI convergence is guaranteed under general function approximators; a case study for a low-order linear system in order to generalize the more complex ORM tracking validation on a real-world nonlinear multivariable aerodynamic process; comparisons with an offline deep deterministic policy gradient solution; implementation details and further discussions on the obtained results.

Palabras claves

approximate dynamic programming - reinforcement learning - data-driven control - model-free control - reference trajectory tracking - output reference model - multivariable control - aerodynamic rotor system - neural networks - learning systems

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 12 Parte: 6 (2019)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Aerospace
Applied Sciences
Algorithms

DOI

https://doi.org/10.3390/a12060121

Art�culos similares

ECARRNet: An Efficient LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection

Acceso

Salman Ibne Eunus, Shahriar Hossain, A. E. M. Ridwan, Ashik Adnan, Md. Saiful Islam, Dewan Ziaul Karim, Golam Rabiul Alam and Jia Uddin

Accidents due to defective railway lines and derailments are common disasters that are observed frequently in Southeast Asian countries. It is imperative to run proper diagnosis over the detection of such faults to prevent such accidents. However, manual... ver m�s

Revista: AI

GCN?Informer: A Novel Framework for Mid-Term Photovoltaic Power Forecasting

Acceso

Wei Zhuang, Zhiheng Li, Ying Wang, Qingyu Xi and Min Xia

Predicting photovoltaic (PV) power generation is a crucial task in the field of clean energy. Achieving high-accuracy PV power prediction requires addressing two challenges in current deep learning methods: (1) In photovoltaic power generation prediction... ver m�s

Revista: Applied Sciences

Interactive Teaching in Virtual Environments: Integrating Hardware in the Loop in a Brewing Process

Acceso

Jessica S. Ortiz, Richard S. Pila, Joel A. Yupangui and Marco M. Rosales

The teaching?learning process developed was based on the effective integration of the Hardware in the Loop (HIL) technique to control a brewing process. This required programming the autonomous control of the system and uploading it to a physical control... ver m�s

Revista: Applied Sciences

Delving into Causal Discovery in Health-Related Quality of Life Questionnaires

Acceso

Maria Ganopoulou, Efstratios Kontopoulos, Konstantinos Fokianos, Dimitris Koparanis, Lefteris Angelis, Ioannis Kotsianidis and Theodoros Moysiadis

Questionnaires on health-related quality of life (HRQoL) play a crucial role in managing patients by revealing insights into physical, psychological, lifestyle, and social factors affecting well-being. A methodological aspect that has not been adequately... ver m�s

Revista: Algorithms

Sea-Surface Small Target Detection Based on Improved Markov Transition Fields

Acceso

Ru Ye, Hongyan Xing and Xing Zhou

Addressing the limitations of manually extracting features from small maritime target signals, this paper explores Markov transition fields and convolutional neural networks, proposing a detection method for small targets based on an improved Markov tran... ver m�s

Revista: Journal of Marine Science and Engineering

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas