2023 AIChE Annual Meeting
(59ac) Control Lyapunov-Barrier Function-Based Safe Reinforcement Learning for Nonlinear Optimal Control
This work presents a novel safe reinforcement learning method to solve the safe optimal control problem for nonlinear systems with input constraints. Specifically, we design a new performance index function such that the value function is a control Lyapunov-barrier function (CLBF) with inherent stability and safety properties [6], and therefore, lead to an optimal control policy with guaranteed stability, safety, and optimality simultaneously. Since it is challenging to obtain the closed form of value functions, we use neural networks (NN) to approximate the CLBF-based value function in the policy iteration algorithm, for which the process operation data that indicate safe and unsafe operations is used to develop an NN-based barrier function. The theoretical results on stability, safety, and optimality of SRL method is developed by accounting for the generalization error of NN-based value function. Finally, an application to a chemical process example is presented to show the effectiveness of the proposed control strategy.
References:
- Nian, J. Liu, and B. Huang, âA review on reinforcement learning: Introduction and applications in industrial process control,â Computers and Chemical Engineering, vol. 139, p. 106886, 2020
- D. Ames, X. Xu, J. W. Grizzle, and P. Tabuada, âControl barrier function based quadratic programs for safety critical systems,â IEEE Transactions on Automatic Control, vol. 62, no. 8, pp. 3861â3876, 2016
- Zanon and S. Gros, âSafe reinforcement learning using robust MPC,â IEEE Transactions on Automatic Control, vol. 66, no. 8, pp. 3638â3652, 2021.
- H. Cohen and C. Belta, âSafe exploration in model-based reinforcement learning using control barrier functions,â Automatica, vol. 147, p. 110684, 2023.
- Marvi and B. Kiumarsi, âSafe reinforcement learning: A control barrier function optimization approach,â International Journal of Robust and Nonlinear Control, vol. 31, no. 6, pp. 1923â1940, 2021.
- Wu, F. Albalawi, Z. Zhang, J. Zhang, H. Durand and P. D. Christofides, âControl Lyapunov-Barrier Function-Based Model Predictive Control of Nonlinear Systems,â Automatica, 109, 108508, 2019.