Next Article in Journal
A Novel Fault Diagnosis Method for Analog Circuits Based on Multi-Input Deep Residual Networks with an Improved Empirical Wavelet Transform
Next Article in Special Issue
Spacecraft Telemetry Anomaly Detection Based on Parametric Causality and Double-Criteria Drift Streaming Peaks over Threshold
Previous Article in Journal
Phobia Exposure Therapy Using Virtual and Augmented Reality: A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Robust ADP-Based Sliding-Mode Fault-Tolerant Control for Nonlinear Systems with Application to Spacecraft

College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(3), 1673; https://doi.org/10.3390/app12031673
Submission received: 10 December 2021 / Revised: 29 January 2022 / Accepted: 31 January 2022 / Published: 6 February 2022

Abstract

:
This paper considers a novel fault-tolerant control (FTC) scheme for a category of cascade nonlinear systems with mismatched uncertainties and unknown actuator faults. The robust adaptive dynamic programming (RADP) is used to design a novel optimal sliding surface (SS) off-line, which renders the corresponding sliding-mode dynamics able to obtain robustness of stability to mismatched uncertainties. Subsequently, a simple sliding-mode control (SMC) with the adaptive fault compensation is developed to guarantee reachability of the sliding mode. Then, it is proven that the weight errors of neural networks (NN) of RADP and the closed-loop system are stable based on Lyapunov stability theory. In the simulation section, this proposed scheme is used to deal with the attitude FTC of a spacecraft, and simulation results verify the effectiveness of the proposed novel control scheme.

1. Introduction

In areas with high safety requirements such as spacecraft, the fault-tolerant control (FTC) has always attracted the attention [1,2] of many scholars and engineers. In [1], a novel discontinuous adaptive failure compensation controller with system chattering rejection based on a common approximate function was designed for a rigid satellite with actuator failure. In [2], an ADP-based tracking FTC with an adaptive fault compensation was given for a spacecraft attitude system with additive faults. In the FTC community, a common and very important method is the sliding-mode control (SMC) that was widely applied in industrial and academic fields due to the extremely simple design and strong anti-disturbance ability [3]. The equivalent control of SMC can cope with matched uncertainties inherently, but for mismatched uncertainties, including faults or failures, it is an open problem [4,5]. The SMC with a new sliding surface including the information of mismatched uncertainties through a disturbance observer was designed in [4]. The invariant manifold-based SS was applied to transform the multiple mismatched disturbances into matched ones in [5]. It has always been very interesting to find more and better ways to solve this open problem.
The ADP approaches have developed a relatively complete paradigm for the optimal control [6]. An online adaptive algorithm was proposed in [7], which involves the simultaneous updating of both critic and actor NN. A classical thinking [8] of transformation from robust control into optimal control was used in RADP methods [9] for adaptive control problems subject to matched or mismatched uncertainties. Specifically, in [10], ADP was introduced to integral sliding-mode control (ISMC) for systems with mismatched disturbance, but the SS design was not considered. Hence, combining SMC with ADP to cope with mismatched uncertainties induces our attentions.
The idea of using other control methods to design SS is an optional combination solution, which is an effective way to deal with nonlinear FTC problems. In [11], the classical control-based SS design combined two control algorithms. When the system states are in sliding mode, the classical control is conducted. The method [12] also adopted the above idea and used a pseudo control input in the SS design process. In [13], a novel fast terminal sliding mode control without the reaching phase was designed, which makes the derivative of an augmented state satisfy an optimal cost function. In [14], the optimal SS similar to [15] was given for a linear system. The robustness of SS to mismatched uncertainties were not considered in these two papers [13,14]. In references [16,17,18,19], optimal SSs were developed based on the first subsystem of the normal form [19]. In [16], an optimal sliding manifold was designed based on kinematics without uncertainties and the angular velocity was seen as the control input in this design process. In [17], optimal design of reaching phase and sliding phase of the SMC was given. In [18], the successive approximation approach was used to solve a novel optimal sliding mode that was designed by the same method. In [19], after an optimal equivalent control was first solved, a novel SS was derived from the above equivalent control. Recently, the ADP algorithm has also been introduced into the integral SMC design [10,20]. In [10], ADP was used to solve the optimal guaranteed cost control problem of sliding mode dynamics. In [20], ADP was employed to design the nominal control, which aimed at stabilizing sliding mode dynamics. However, in these two papers [10,20], ADP was adopted to stabilize sliding mode dynamics rather than to design SS directly.
In this paper, we intend to explore a unifying design of sliding-mode FTC with the RADP-based SS for mismatched uncertain nonlinear cascade systems with actuator faults. To this end, the RADP-based SS is developed to obtain robust sliding mode for mismatched uncertainties, and an adaptive sliding-mode FTC is designed to guarantee the reachability of above sliding mode in the presence of faults. The detailed contributions are summarized as follows.
  • The RADP-based SS is designed, which is robust to the mismatched uncertainties. An adaptive fault compensation is used to deal with the actuator faults. Compared with the traditional sliding-mode FTC [21], the novel scheme in the present paper achieves a certain sense of optimized performance by introducing the RADP-based SS. To the best knowledge of the authors, the ADP-based SS does not appear in previous reports, and the more important factor of a novel SMC tackling mismatched uncertainties based on this new SS is not reported either.
  • The idea of the ADP-based SS design method may be open a door that we can obtain many kinds of sliding modes with various properties inherited from ADP methods, such as finite-horizon optimal SS [22] or safety SS [23]. These provide more methods or possibilities for solving FTC problems.
The rest of this article is organized as follows. Section 2 gives the problem formulation. In Section 3, an adaptive sliding-mode FTC with RADP-based SS for the uncertain cascade systems is developed. The simulation examples are provided in Section 4 followed by the conclusions in Section 5.

2. Problem Formulation

In this section, the system model, the fault model, and the problem formulation are given.

2.1. System Model and Fault Model

Consider a class of the nonlinear cascade systems [24,25] encountering mismatched uncertainties and actuator faults [26,27], and it can be depicted by
x ˙ 1 ( t ) = f 1 ( x 1 ( t ) ) + g 1 ( x 1 ( t ) ) x 2 ( t ) + f ( x 1 ( t ) ) ,
x ˙ 2 ( t ) = f 2 ( x ( t ) ) + g 2 ( x ( t ) ) ( u ( t ) + f a ) ,
where x 1 ( t ) R n 1 , x 2 ( t ) R n 2 are system states, u ( t ) R m is the control input, and f i ( · ) and g i ( · ) ( i = 1 , 2 ) are differentiable in their arguments with f i ( 0 ) = 0 ( i = 1 , 2 ), f ( x 1 ) is the unknown perturbation with f ( 0 ) = 0 , f a is the actuator fault vector.
For ease of the description and the following RADP design, nominal systems of above systems (1) and (2) are given as
x ˙ 1 ( t ) = f 1 ( x 1 ( t ) ) + g 1 ( x 1 ( t ) ) x 2 ( t ) ,
x ˙ 2 ( t ) = f 2 ( x ( t ) ) + g 2 ( x ( t ) ) u ( t ) .
In order to simplify the expression, some function arguments are omitted in the description below. Then, for convenience of later analysis, the following assumptions and definition are given.
Assumption 1.
It is assumed that the control input matrix g 1 ( x 1 ) is bounded i.e., g 1 ( x 1 ) λ g , where λ g is a positive constant. The g 2 ( · ) is a full row rank matrix and the function g 2 + ( · ) is defined as the Moore–Penrose pseudo-inverse of g 2 ( · ) , i.e., g 2 + ( · ) = g 2 T ( · ) ( g 2 ( · ) g 2 T ( · ) ) 1 .
Assumption 2.
It is assumed that the uncertainty is bounded by a function λ f ( x 1 ) , i.e., f ( x 1 ) λ f ( x 1 ) with λ f ( 0 ) = 0 , and f a is the unknown constant bias actuator fault vector.
Definition 1
([7]). The equilibrium point denoted x = 0 of a dynamical system is said to be uniformly ultimately bounded (UUB) if there exists a compact set Ω R n so that for all state variable initial value x 0 Ω there exists a bound B and a time T ( B , x 0 ) such that x x 0 B for all t t 0 + T .

2.2. Control Objective

The purpose of the present paper is to design a novel SS and the corresponding FTC algorithm, which causes the following novel quadratic performance index to be minimized when the closed-loop system is reaching the SS:
J ( x 1 ( t ) ) = t Q ( x 1 ( τ ) ) + U ( x 1 ( τ ) , x 2 ( τ ) ) d τ ,
where Q is positive semi-definite matrix, which is associated with the dynamical uncertainty, and U ( x 1 , x 2 ) = x 1 T Q x 1 + x 2 T R x 2 is a utility function. The overall utility function Q ( x 1 ) + U ( x 1 , x 2 ) reflects the uncertainty, regulation, and control policy simultaneously. In particular, the state x 2 in subsystem (3) is so-called virtual control [18], which is designed based on a nominal subsystem (3).
Remark 1.
The choice of the utility function is a critical step for optimal control problems. In this paper, a robust control problem is transformed into an optimal control problem by choosing a novel utility function as described above. For an optimal control with input constraints, a popular utility function [28] is chosen as
U ( x , u ) = x ( t ) T Q x ( t ) + 2 0 u ( Φ 1 ( v ) ) T R d v ,
where Φ ( v ) = [ ϕ ( v 1 ) ϕ ( v m ) ] T , v R m , Φ R m , and ϕ ( · ) is a bounded one-to-one function that belongs to C p ( p 1 ) and L 2 ( Ω ) . The hyperbolic tangent ϕ ( · ) = t a n h ( · ) is an example. For an optimal control problem with state constraints, the utility function can be selected as
U ( x , u ) = x ( t ) T Q x ( t ) + V x + u ( t ) T R u ( t ) ,
where V x is a barrier function with respect to state x ( t ) [29]. It is not difficult to see from the above examples that the utility function is not only related to the requirements for state and control, but also directly related to the uncertainty and constraints to be dealt with.
Remark 2.
The value function (5) is designed based on the idea that the robust stabilization problem can be transformed into an optimal control of nominal systems [30].

3. Sliding Surface and Control Design

Concretely, for the system (1) and (2), a RADP-based SS is designed as
s = x 2 + 1 2 R 1 g 1 T ( x 1 ) ( σ c ( x 1 ) ) T W ^ c ,
where s is the sliding-mode variable for the mismatched uncertain cascade system, the matrix R is a penalty matrix in the utility function, the σ c is an activation function vector of critic neural network (CNN) in ADP, the σ c is gradient with respect to x 1 denoted as σ c = σ c x 1 , and W ^ c is an estimation of ideal weight vector in ADP. The following assumption is a standard assumption in ADP-related literatures, such as [2,10,31].
Assumption 3.
The ideal NN weights W c is bounded, i.e., W c λ W . The NN activation functions and their gradient are bounded, i.e., σ c ( x 1 ) λ σ , and σ c ( x 1 ) λ σ . The NN approximation error and its gradient are bounded on the compact set Ω, i.e., ε c ( x 1 ) λ ε , and ε c ( x 1 ) λ ε .

3.1. Optimal Control-Based Sliding Surface Design

In what follows, the x 2 is regarded as the control input of subsystem (3) and assumed that it is admissible. For the continuously differentiable cost function J ( x 1 ) , we can obtain the nonlinear Lyapunov equation is
0 = Q ( x 1 ) + U ( x 1 , x 2 ) + ( J ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x 2 ) ,
where J ( 0 ) = 0 . The Hamiltonian function of this optimal control problem is
H ( x 1 , x 2 , J ( x 1 ) ) = Q ( x 1 ) + U ( x 1 , x 2 ) + ( J ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x 2 ) .
Define the optimal cost function (OCF) as
J * ( x 1 ( t ) ) = min x 2 A ( Ω ) t Q ( x 1 ( τ ) ) + U ( x 1 ( τ ) , x 2 ( τ ) ) d τ ,
where A ( Ω ) is the admissible control set on the Ω , and the HJB equation of this system is given as
0 = min x 2 A ( Ω ) H ( x 1 , x 2 , J * ( x 1 ) ) .
Therefore, by differentiating (10) with respect to x 2 * , the optimal control policy is
x 2 * = 1 2 R 1 g 1 T ( x 1 ) J * ( x 1 ) .
The control policy (11) is unbounded, and if the problem of input constraints is to be studied, an alternative approach is to use a generalized non-quadratic utility function [28]. Substitute the optimal control (11) into the Equation (10) and then the HJB equation can be expressed as
0 = Q ( x 1 ) + U ( x 1 , x 2 * ) + ( J * ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x 2 * ) = Q ( x 1 ) + x 1 T Q x 1 + ( J * ( x 1 ) ) T f 1 ( x 1 ) 1 4 ( J * ( x 1 ) ) T g 1 ( x 1 ) R 1 g 1 T ( x 1 ) J * ( x 1 ) ,
which means that H ( x 1 , x 2 * , J * ( x 1 ) ) = 0 .
The design of Q ( x 1 ) is a key point in the robust stabilization scheme. In what follows, the term Q ( x 1 ) is specified as
Q ( x 1 ) = 1 8 ( J * ( x 1 ) ) T J * ( x 1 ) + 2 λ f 2 ( x 1 ) ,
and Q ( x 1 ) 0 meets. Based on the help of this form, the following lemma showing stability of the sliding-mode dynamics of the systems (1) and (2) is derived as follows.
Lemma 1.
Consider the nominal subsystem (3) and the cost function (5) with the term (13). It is assumed the HJB Equation (12) is of a solution J * ( x 1 ) , x 2 * is given by (11) and the SS designed as s = x 2 x 2 * = 0 exists. According to these conditions, on this SS, the sliding-mode dynamics of this mismatched uncertain nonlinear system described by (1) and (2) is asymptotic stability.
Proof of Lemma 1. 
According to the Theorem 1 of [30], we know that the optimal control policy (11) can ensure asymptotic stability of the uncertain nonlinear subsystem (1). On the SS, there is x 2 = x 2 * ( x 1 ) and explicitly the state x 2 is also asymptotically convergent to the equilibrium point. □

3.2. RADP-Based Sliding Surface Design

In what follows, the RADP with NN approximation is used to find this SS through obtaining a numerical solution for the above HJB equation, the analytical solution of which is rather difficult to calculate.
Based on the universal approximation property of NN, approximate J * ( x 1 ) by a NN with only one hidden layer as
J * ( x 1 ) = W c T σ c ( x 1 ) + ε c ( x 1 ) ,
where W c R p is the ideal weight, σ c ( x 1 ) R p is the activation function, p is the number of neurons, and ε c ( x 1 ) is the unknown approximation error of this NN. Next, partial derivative of the OCF is written as
J * ( x 1 ) = J * ( x 1 ) x 1 = ( σ c ( x 1 ) ) T W c + ε c ( x 1 ) .
However, the ideal NN weights are unknown, the estimation of NN weight vector W ^ c approximates the OCF.
J ^ * ( x 1 ) = W ^ c T σ c ( x 1 ) .
The partial derivative of the OCF by approximation is
J ^ * ( x 1 ) = J ^ * ( x 1 ) x 1 = ( σ c ( x 1 ) ) T W ^ c .
From (15), the optimal control can be given as
x 2 * = 1 2 R 1 g 1 T ( x 1 ) ( ( σ c ( x 1 ) ) T W c + ε c ( x 1 ) ) .
From (17), the approximate optimal control policy can be expressed as
x ^ 2 * = 1 2 R 1 g 1 T ( x 1 ) ( σ c ( x 1 ) ) T W ^ c .
For the following derivation, two non-negative matrices are denoted as follows
A ( x 1 ) = σ c ( x 1 ) g 1 ( x 1 ) R 1 g 1 T ( x 1 ) ( ( σ c ( x 1 ) ) T ,
B ( x 1 ) = σ c ( x 1 ) ( ( σ c ( x 1 ) ) T .
By using the cost function and control policy denoted by the NN, the Hamiltonian function could be given by
H ( x 1 , W c ) = x 1 T Q x 1 + W c T σ c ( x 1 ) f 1 ( x 1 ) 1 4 W c T A ( x 1 ) W c + 2 λ f 2 ( x 1 ) + 1 8 W c T B ( x 1 ) W c + e c H = 0 ,
where the term e c H is the residual error of the NN expression.
e c H = ( ε c ( x 1 ) ) T f 1 ( x 1 ) 1 2 ( ε c ( x 1 ) ) T g 1 ( x 1 ) R 1 g 1 T ( x 1 ) ( ( σ c ( x 1 ) ) T W c 1 4 ( ε c ( x 1 ) ) T g 1 ( x 1 ) R 1 g 1 T ( x 1 ) ε c ( x 1 ) + 1 4 ( ε c ( x 1 ) ) T ( σ c ( x 1 ) ) T W c + 1 8 ( ε c ( x 1 ) ) T ε c ( x 1 ) .
The approximate Hamiltonian by the estimated weight is described as
H ^ ( x 1 , W ^ c ) = x 1 T Q x 1 + W ^ c T σ c ( x 1 ) f 1 ( x 1 ) 1 4 W ^ c T A ( x 1 ) W ^ c + 2 λ f 2 ( x 1 ) + 1 8 W ^ c T B ( x 1 ) W ^ c ,
letting e c = H ^ ( x 1 , W ^ c ) H ( x 1 , W c ) , and W ˜ c = W c W ^ c . Based on (20) and (22), the e c with W ˜ c is formulated by
e c = H ^ ( x 1 , W ^ c ) H ( x 1 , W c ) = W ˜ c T σ c ( x 1 ) f 1 ( x 1 ) 1 4 W ˜ c T A ( x 1 ) W ˜ c + 1 2 W ˜ c T A ( x 1 ) W c + 1 8 W ˜ c T B ( x 1 ) W ˜ c 1 4 W ˜ c T B ( x 1 ) W c e c H .
Now, the next step is to train the CNN and obtain an update law of weights by minimizing the simple cost criterion E c = ( 1 / 2 ) e c 2 . For avoiding the difficulty of finding an admissible initial control policy, the improved weight updating rule [30] is also selected. The following general assumption [2,32] are given before the training process and the stability analysis.
Assumption 4.
Considering the nominal subsystem (3) with the cost function (5) and substituting the optimal feedback control (11) into the subsystem (3). Let J s ( x 1 ) be a continuously differentiable function and simultaneously a Lyapunov function candidate. It can be written as
J ˙ s ( x 1 ) = ( J s ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x 2 * ) < 0 .
The Γ R n 1 is a positive definite matrix ensuring that
( J s ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x 2 * ) = ( J s ( x 1 ) ) T Γ J s ( x 1 ) λ m i n ( Γ ) J s ( x 1 ) 2 ,
is true, where λ m i n ( Γ ) is the minimal eigenvalue of the matrix Γ.
For implementing this algorithm, J s ( x 1 ) can be given by some polynomials of the system states, such as J s ( x 1 ) = 0.5 x 1 T x 1 . Then, a weight update law of the CNN is constructed
W ^ ˙ c = α c ( E c W ^ c ) α s ( [ ( J s ( x 1 ) ) T ( f 1 ( x 1 ) + g 1 ( x 1 ) x ^ 2 * ) ] W ^ c ) ,
where α c > 0 is the basic update rate of the critic network, and α s > 0 is the learning rate of the additional stabilization term. Substitute the optimal control (19) into (26) and then the update law is expressed as
W ^ ˙ c = α c ( E c W ^ c ) + 1 2 α s σ c ( x 1 ) g 1 ( x 1 ) R 1 g 1 T ( x 1 ) J s ( x 1 ) .
According to the above formula, W ˜ ˙ c = W ^ ˙ c and dynamics of the weight estimation error (WEE) could be expressed as
W ˜ ˙ c = α c ( E c W ^ c ) 1 2 α s σ c ( x 1 ) g 1 ( x 1 ) × R 1 g 1 T ( x 1 ) J s ( x 1 ) ,
where
E c W ^ c = e c e c W ^ c = e c ( σ c ( x 1 ) f 1 ( x 1 ) 1 2 A ( x 1 ) W ^ c + 1 4 B ( x 1 ) W ^ c ) .
Thus, dynamics of WEE could be expressed as
W ˜ ˙ c = α c ( W ˜ c T σ c ( x 1 ) f 1 ( x 1 ) 1 4 W ˜ c T A ( x 1 ) W ˜ c + 1 2 W ˜ c T A ( x 1 ) W c + 1 8 W ˜ c T B ( x 1 ) W ˜ c 1 4 W ˜ c T B ( x 1 ) W c e c H ) × ( σ c ( x 1 ) f 1 + 1 2 A ( x 1 ) W ˜ c 1 2 A ( x 1 ) W c 1 4 B ( x 1 ) W ˜ c + 1 4 B ( x 1 ) W c ) 1 2 α s σ c ( x 1 ) g 1 ( x 1 ) × R 1 g 1 T ( x 1 ) J s ( x 1 ) .
According to this design, the following Lemma 2 shows the property of the system on the novel robust SS.
Lemma 2.
Assuming the sliding mode exists, and considering the nominal system described by (3) and (4) under the proposed RADP-based SS (6), the states of system (1) and (2) on this SS and the weight error dynamics are both UUB.
Proof of Lemma 2. 
Based on Theorem 2 of [30], we can know that state x 1 on this SS and dynamics of WEE are both UUB. For brevity, the proof process will not be repeated. Additionally, according to the SS s = 0 , state x 2 of the system on this SS is also UUB. □
Remark 3.
The use of ADP faces some problems, such as the difficulty of choosing the initial admissible control, how to choose activation functions of the neural network, and the training taking too much time. In order to avoid the impact of these problems, the off-line training method and other empirical methods are used to ensure the real-time performance of the overall control system as much as possible.

3.3. Sliding-Mode Control Design

After this novel SS is obtained through the above off-line design process, an adaptive fault-tolerant sliding-mode control is developed.
Considering the system (1) and (2) and the approximating optimal control policy of (19), the sliding-mode variable can be expressed as
s = x 2 x ^ 2 * ( x 1 ) .
Note that the control x ^ 2 * ( x 1 ) is differential according to (19), and the factory is that W ^ c is a constant vector after convergence. Therefore, the derivative of the sliding-mode variable is written as
s ˙ = x ˙ 2 x ^ ˙ 2 * ( x 1 ) = f 2 ( x ) + g 2 ( x ) ( u + f a ) x ^ ˙ 2 * ( x 1 ) .
The novel SMC design with the RADP-based SS and the adaptive fault compensation law is designed as
u = g 2 + ( x ) ( x ^ ˙ 2 * ( x 1 ) f 2 ( x ) K s ( s g n ( s ) ) ) f ^ a ,
and
f ^ ˙ a = K f g 2 T ( x ) s ,
respectively, where K s , K f > 0 are the control gain and learning rate of the adaptive law and f ^ a is estimation of actuator faults. Define the fault estimation error as f ˜ a = f a f ^ a .
Theorem 1.
Considering the mismatched uncertain and faulty system described by (1) and (2) under the proposed RADP-based sliding-mode variable (6), the control policy (32) and adaptive fault compensation law (33), the fault estimation error is bounded and the closed-loop system dynamics is UUB.
Proof of Theorem 1. 
Choose the following Lyapunov function candidate
V ( t ) = 1 2 s T s + 1 2 K f f ˜ a T f ˜ a .
The derivative of the Lyapunov function candidate (34) with respect to time along the sliding mode variable dynamics is
V ˙ ( t ) = s T s ˙ 1 K f f ˜ a T f ^ ˙ a = s T ( f 2 ( x ) + g 2 ( x ) ( u + f a ) x ^ ˙ 2 * ( x 1 ) ) 1 K f f ˜ a T f ^ ˙ a .
Substituting the control policy (32) and the actuator fault compensation law (33) into above formula (35), we obtain
V ˙ ( t ) = K s s + s T g 2 ( x ) f ˜ a 1 K f f ˜ a T f ^ ˙ a = K s s 0 .
According to the above derivation process, we could find that all signals of closed-loop system are bounded. Based on the Lyapunov theorem and Lemma 2, the sliding mode exists and the states of the whole system are uniformly ultimately bounded. From the above derivation, it can be seen that the reaching time of the SMC can be estimated by t r 2 V 1 / 2 ( 0 ) α , where t r is the reaching time, V ( 0 ) is the initial value of the Lyapunov function candidate, and α is a positive design parameter. Therefore, the corresponding time can be reduced by adjusting the parameter α . The proof ends here. □
Novelty and comparison: How to make the designed sliding surface (SS) insensitive to the mismatched uncertainties is crucial for the SMC design and the SMC-based FTC.
In [4], a mismatched disturbance is estimated by a disturbance observer, and then the obtained estimation signal of this disturbance is used to design the SS. The disturbance observer makes the system more complex, which increases the difficulty of controller design. However, our designed SS can directly deal with this problem by the optimal control method without designing any estimator. Even though finding the solution of optimal control is tricky, we can solve it directly using the ADP method off-line.
In [10], the integral SMC (ISMC) law is divided into two parts, one is a continuous part, and the other is a discontinuous part. The continuous part of control is used to reject the mismatched uncertainties and consequently the SS is insensitive to the mismatched uncertainties. In this process, the complex zero-sum games and online complex neural network training are used in the continuous part of control, which are not conducive to practical applications and the realization of engineers. Our method is simpler in the design process and implemented off-line.
In the papers [13,14], although they all use the optimal control to design the SS, they cannot solve the problem that is making the SS insensitive to the mismatched uncertainties in the SS design process.

4. Simulation Examples

In this part, the novel sliding-mode FTC with the RADP-based SS is applied in the mismatched uncertain spacecraft attitude control systems (ACS), which are the benchmark nonlinear cascade systems. The model of this ACS [2,33] is presented as
ϑ ˙ ψ ˙ γ ˙ = f ( x 1 ) ω + f ( x 1 ) ,
and
J ω ˙ = ω × J ω + u ( t ) + f a ,
where the inertia matrix J and f ( x 1 ) are characterized by
J = 20 0 0.9 0 17 0 0.9 0 15 kg · m 2 ,
f ( x 1 ) = 0 sin γ cos γ 0 cos γ / cos ϑ sin γ / cos ϑ 1 tan ϑ cos γ tan ϑ sin γ ) ,
the mismatched uncertainties f ( x 1 ) and the actuator faults f a are given as
f ( x 1 ) = 0.1 ϑ s i n ( ψ ) c o s ( γ ) 0.1 γ c o s ( ψ 2 ) 0.1 ψ s i n ( ϑ ) ,
f a = [ 0 , 0 , 0 ] T , t < 50 s [ 0 , 0.5 , 0.8 ] T , t 50 s
furthermore, ϑ , ψ , and γ are pitch angle, yaw angle, and roll angle of a spacecraft, simply denoted as a vector x 1 = [ ϑ , ψ , γ ] T , and ω x , ω y , and ω z are angular velocities with respect to an inertial reference frame expressed in the body-fixed frame. In addition, the uncertainty f ( x 1 ) is bounded, i.e., f ( x 1 ) x 1 . The initial value of states of this ACS are x 1 ( 0 ) = [ 0.5 , 0.5 , 0.5 ] T r a d , and ω ( 0 ) = [ 0.1 , 0.1 , 0.1 ] T r a d / s .

4.1. Traditional SMC-Based Fault-Tolerant Controller

To compare with the proposed method in this paper, a traditional SMC-based FTC is designed as follows
u = ( f ( x 1 ) J 1 ) 1 ( f ˙ ( x 1 ) ω + f ( x 1 ) J 1 ω × J ω c f ( x 1 ) ω f ( x 1 ) J 1 f ^ a K s s a t ( s t ) ,
and the adaptive fault compensation is
f ^ ˙ a = K f J T f ( x 1 ) T s t ,
where the designed sliding surface is s t = f ( x 1 ) ω + c x 1 with c = 0.1 and let K s = 2 and K f = 100 . The simulation results are given as follows, and comments are given in the next subsection in a comparative manner.

4.2. RADP-Based Sliding-Mode Fault-Tolerant Controller

The total utility function is chosen as Q ( x 1 ) + U ( x 1 , ω ) = Q ( x 1 ) + x 1 T Q x 1 + ω T R ω with Q = I 3 and R = I 3 for the uncertain ACS. A CNN is constructed to approximate the OCF as
J ^ n * ( x 1 ) = W ^ c 1 ϑ 2 + W ^ c 2 ψ 2 + W ^ c 3 γ 2 + W ^ c 4 ϑ ψ + W ^ c 5 ϑ γ + W ^ c 6 ϑ ψ γ .
Note that σ c ( a ) = [ ϑ 2 , ψ 2 , γ 2 , ϑ ψ , ϑ γ , ϑ ψ γ ] T and W ^ c = [ W ^ c 1 , W ^ c 2 , W ^ c 3 , W ^ c 4 , W ^ c 5 , W ^ c 6 ] T are the activation function and the estimation of the ideal weight of NN, respectively. In these simulations, we set α c = 0.1 and σ s = 0.5 . Substitute the constant weight W ^ c after convergence into the novel SS and let K s = 2 and K f = 100 .
Via Figure 1, Figure 2 and Figure 3, it can be found that, under the basically same parameter settings, the proposed control scheme posses better convergence characteristics and less adjustment time and, through the boundary layer technology, no chattering phenomena. Observe the above simulation results of the proposed novel control scheme and it can be found that the states of the systems are asymptotic stability rather than UUB in Theorem 1. The reason for this behaviour is that g 2 ( x ) = J is invertible constant matrix and this makes the error 1 2 R 1 g 1 T ( x 1 ) ε c ( x 1 ) in (18) matched with g 2 ( x ) . Therefore, if K s is large enough, the SMC (32) can guarantee its robustness against this error. However, in the SS design process of traditional SMC-based FTC, it is not considered to deal with the mismatched uncertainties, so that system states cannot eventually converge to zero, even if the SMC guarantees reachability of the SS. The evolution of the weight W ^ c is shown in Figure 4, and finally the weight vector eventually converges to [ 0.1370 , 0.1467 , 0.1326 , 0.1307 , 0.1204 , 0.0786 ] T .

4.3. RADP-Based Sliding-Mode Fault-Tolerant Controller in Presence of Input Delay

Factors that cause delays are numerous in the ACS, such as the response time of actuators and sensors, faults of actuators and sensors, signal sampling and transmission, etc. The effects will be reflected in the input delay [34]. The existence of the input delay will cause the degradation of control performance and even destroy the stability of the system. To verify whether the proposed controller can cope with the time delay uncertainty, in the above simulation example, an unknown constant time delay is added into the original control system:
J ω ˙ = ω × J ω + u ( t τ ) + f a ,
where the time delay is set as τ = 0.1 and other parameters are the same as the values set above. The simulation results are shown as Figure 5, Figure 6 and Figure 7.
By comparing the simulation results before and after adding the input delay, it can be found that the proposed control scheme can make the time delay system bounded, but the control performance has dropped. If a better control performance is pursued, special design for time delay is essential.

5. Conclusions

In the present paper, a novel SMC based FTC scheme with RADP-based SS for a category of uncertain cascade systems is proposed. In particular, a novel idea for the ADP-based SS design is proposed to cope with the mismatched uncertainties in the SMC theory and application. For the practical implementing, the off-line design of RADP-based SS avoids the time-consuming training of neural network weights, and the adaptive SMC is also helpful to implement. In addition, the simulation part demonstrates a practical example that is the attitude control system of the spacecraft. Note that the uncertainty is also mismatched for the subsystem in this paper. Furthermore, matched uncertainties including certain faults for the subsystem are worth researching in the future. In addition, more new sliding surfaces based on the ADP algorithm treasure house could be designed, which can be combined with the powerful SMC mechanism to solve more difficult FTC problems.

Author Contributions

Y.D. provided methodology, validation, and writing—original draft preparation; B.J., Y.M. and Y.C. provided conceptualization, writing—review & editing, and supervision; and B.J. provided funding support. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grant 62020106003, Grant 61922042 and Grant 61773201, the Fundamental Research Funds for the Central Universities of China under Grant NC2020002, and Grant NP2020103, the 111 Project of the Programme of Introducing Talents of Discipline to Universities of China under Grant B20007, and the 13th Five-Year Equipment Pre-research Projects of China under Grant 30501050403.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ma, Y.; Jiang, B.; Tao, G.; Cheng, Y. Actuator failure compensation and attitude control for rigid satellite by adaptive control using quaternion feedback. J. Frankl. Inst. 2014, 351, 296–314. [Google Scholar] [CrossRef]
  2. Du, Y.; Jiang, B.; Ma, Y. Policy Iteration Based Online Adaptive Optimal Fault Compensation Control for Spacecraft. Int. J. Control Autom. Syst. 2021, 19, 1607–1617. [Google Scholar] [CrossRef]
  3. Utkin, V.I. Sliding Modes in Control and Optimization; Springer: Berlin/Heidelberg, Germany, 1992. [Google Scholar]
  4. Yang, J.; Li, S.; Yu, X. Sliding-mode control for systems with mismatched uncertainties via a disturbance observer. IEEE Trans. Ind. Electron. 2012, 60, 160–169. [Google Scholar] [CrossRef]
  5. Zhang, L.; Yang, J.; Li, S.; Yu, X. Invariant manifold based output-feedback sliding mode control for systems with mismatched disturbances. IEEE Trans. Circuits Syst. II-Express Briefs 2020, 68, 933–937. [Google Scholar] [CrossRef]
  6. Zhang, H.; Zhang, X.; Luo, Y.; Yang, J. An overview of research on adaptive dynamic programming. Acta Autom. Sin. 2013, 39, 303–311. [Google Scholar] [CrossRef]
  7. Vamvoudakis, K.G.; Lewis, F.L. Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 2010, 46, 878–888. [Google Scholar] [CrossRef]
  8. Lin, F.; Brandt, R.D.; Sun, J. Robust control of nonlinear systems: Compensating for uncertainty. Int. J. Control 1992, 56, 1453–1459. [Google Scholar] [CrossRef]
  9. Wang, D.; Liu, D.; Li, H.; Ma, H. Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming. Inf. Sci. 2014, 282, 167–179. [Google Scholar] [CrossRef]
  10. Zhang, H.; Qu, Q.; Xiao, G.; Cui, Y. Optimal guaranteed cost sliding mode control for constrained-input nonlinear systems with matched and unmatched disturbances. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 2112–2126. [Google Scholar] [CrossRef]
  11. Yeung, K.S.; Cheng, C.C.; Kwan, C.M. A unifying design of sliding mode and classical controllers. IEEE Trans. Autom. Control 1993, 38, 1422–1427. [Google Scholar] [CrossRef]
  12. Wen, C.C.; Cheng, C.C. Design of sliding surface for mismatched uncertain systems to achieve asymptotical stability. J. Franklin Inst. 2008, 345, 926–941. [Google Scholar] [CrossRef]
  13. Saeedi, M.; Zarei, J.; Razavi-Far, R.; Saif, M. Event-Triggered Adaptive Optimal Fast Terminal Sliding Mode Control Under Denial-of-Service Attacks. IEEE Syst. J. 2021. [Google Scholar] [CrossRef]
  14. Perrusquía, A.; Flores-Campos, J.A.; Yu, W. Optimal sliding mode control for cutting tasks of quick-return mechanisms. ISA Trans. 2021. [Google Scholar] [CrossRef] [PubMed]
  15. Shtessel, Y.; Edwards, C.; Fridman, L.; Levant, A. Sliding Mode Control and Observation; Springer: New York, NY, USA, 2014; Volume 10. [Google Scholar]
  16. Crassidis, J.L.; Vadali, S.R.; Markley, F.L. Optimal variable-structure control tracking of spacecraft maneuvers. J. Guid. Control Dyn. 2000, 23, 564–566. [Google Scholar] [CrossRef]
  17. Xu, J.X.; Zhang, J. On quasi-optimal variable structure control approaches. In Variable Structure Systems: Towards the 21st Century; Springer: Berlin/Heidelberg, Germany, 2002; pp. 175–200. [Google Scholar]
  18. Tang, G.Y.; Dong, R.; Gao, H.W. Optimal sliding mode control for nonlinear systems with time-delay. Nonlinear Anal.-Hybrid Syst. 2008, 2, 891–899. [Google Scholar] [CrossRef]
  19. Janardhanan, S.; Kariwala, V. Multirate-output-feedback-based LQ-optimal discrete-time sliding mode control. IEEE Trans. Autom. Control 2008, 53, 367–373. [Google Scholar] [CrossRef]
  20. Fan, Q.Y.; Yang, G.H. Adaptive actor–critic design-based integral sliding-mode control for partially unknown nonlinear systems with input disturbances. IEEE Trans. Neural Netw. Learn. Syst. 2015, 27, 165–177. [Google Scholar] [CrossRef]
  21. Shen, Q.; Wang, D.; Zhu, S.; Poh, E.K. Integral-type sliding mode fault-tolerant control for attitude stabilization of spacecraft. IEEE Trans. Control Syst. Technol. 2014, 23, 1131–1138. [Google Scholar] [CrossRef]
  22. Cheng, T.; Lewis, F.L.; Abu-Khalaf, M. Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach. IEEE Trans. Neural Netw. 2007, 18, 1725–1737. [Google Scholar] [CrossRef]
  23. Yang, Y.; Vamvoudakis, K.G.; Modares, H. Safe reinforcement learning for dynamical games. Int. J. Robust Nonlinear Control 2020, 30, 3706–3726. [Google Scholar] [CrossRef]
  24. Krstic, M.; Kokotovic, P.V.; Kanellakopoulos, I. Nonlinear and Adaptive Control Design; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1995. [Google Scholar]
  25. Wang, Z.; Liu, X.; Liu, K.; Li, S.; Wang, H. Backstepping-based Lyapunov function construction using approximate dynamic programming and sum of square techniques. IEEE T. Cybern. 2016, 47, 3393–3403. [Google Scholar] [CrossRef] [PubMed]
  26. Jiang, B.; Staroswiecki, M.; Cocquempot, V. Fault accommodation for nonlinear dynamic systems. IEEE Trans. Autom. Control 2006, 51, 1578–1583. [Google Scholar] [CrossRef]
  27. Zhang, K.; Jiang, B.; Cocquempot, V. Adaptive observer-based fast fault estimation. Int. J. Control Autom. Syst. 2008, 6, 320–326. [Google Scholar]
  28. Abu-Khalaf, M.; Lewis, F.L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005, 41, 779–791. [Google Scholar] [CrossRef]
  29. Dong, H.; Zhao, X.; Yang, H. Reinforcement learning-based approximate optimal control for attitude reorientation under state constraints. IEEE Trans. Control Syst. Technol. 2021, 29, 1664–1673. [Google Scholar] [CrossRef]
  30. Wang, D.; Liu, D.; Mu, C.; Zhang, Y. Neural network learning and robust stabilization of nonlinear systems with dynamic uncertainties. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 1342–1351. [Google Scholar] [CrossRef] [PubMed]
  31. Lewis, F.; Jagannathan, S.; Yesildirak, A. Neural Network Control of Robot Manipulators and Non-Linear Systems; Taylor & Francis: London, UK, 1999. [Google Scholar]
  32. Wang, D.; Liu, D.; Zhang, Y.; Li, H. Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems. Neural Netw. 2018, 97, 11–18. [Google Scholar] [CrossRef]
  33. Ma, Y.; Jiang, B.; Tao, G.; Yang, H. Adaptive direct compensation control scheme for spacecraft with multiple actuator faults. J. Guid. Control Dyn. 2019, 42, 923–930. [Google Scholar] [CrossRef]
  34. Liu, C.; Vukovich, G.; Sun, Z.; Shi, K. Observer-based fault-tolerant attitude control for spacecraft with input delay. J. Guid. Control Dyn. 2018, 41, 2041–2053. [Google Scholar] [CrossRef]
Figure 1. Evolution of attitude of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Figure 1. Evolution of attitude of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Applsci 12 01673 g001
Figure 2. Evolution of angular velocity of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Figure 2. Evolution of angular velocity of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Applsci 12 01673 g002
Figure 3. Evolution of control inputs of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Figure 3. Evolution of control inputs of the ACS under different control schemes in the presence of mismatched uncertainties and actuator faults. (a) Traditional SMC-based FTC. (b) The proposed FTC with RADP-based sliding surface.
Applsci 12 01673 g003
Figure 4. Evolution of the critic neural network weights.
Figure 4. Evolution of the critic neural network weights.
Applsci 12 01673 g004
Figure 5. Comparison of attitude of ACS without and with input delay. (a) Evolution of attitude of the ACS. (b) Evolution of attitude with input delay.
Figure 5. Comparison of attitude of ACS without and with input delay. (a) Evolution of attitude of the ACS. (b) Evolution of attitude with input delay.
Applsci 12 01673 g005
Figure 6. Comparison of angular velocity of ACS without and with input delay. (a) Evolution of angular velocity of the ACS. (b) Evolution of attitude with input delay.
Figure 6. Comparison of angular velocity of ACS without and with input delay. (a) Evolution of angular velocity of the ACS. (b) Evolution of attitude with input delay.
Applsci 12 01673 g006
Figure 7. Comparison of inputs of ACS without and with input delay. (a) Evolution of control input of the ACS. (b) Evolution of control input with input delay.
Figure 7. Comparison of inputs of ACS without and with input delay. (a) Evolution of control input of the ACS. (b) Evolution of control input with input delay.
Applsci 12 01673 g007
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Du, Y.; Jiang, B.; Ma, Y.; Cheng, Y. Robust ADP-Based Sliding-Mode Fault-Tolerant Control for Nonlinear Systems with Application to Spacecraft. Appl. Sci. 2022, 12, 1673. https://doi.org/10.3390/app12031673

AMA Style

Du Y, Jiang B, Ma Y, Cheng Y. Robust ADP-Based Sliding-Mode Fault-Tolerant Control for Nonlinear Systems with Application to Spacecraft. Applied Sciences. 2022; 12(3):1673. https://doi.org/10.3390/app12031673

Chicago/Turabian Style

Du, Yanbin, Bin Jiang, Yajie Ma, and Yuehua Cheng. 2022. "Robust ADP-Based Sliding-Mode Fault-Tolerant Control for Nonlinear Systems with Application to Spacecraft" Applied Sciences 12, no. 3: 1673. https://doi.org/10.3390/app12031673

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop