南方科技大学知识苑(SUSTech KC): Safety-Aware Optimal Control of Nonlinear Systems Using Off-Policy Reinforcement Learning*

题名	Safety-Aware Optimal Control of Nonlinear Systems Using Off-Policy Reinforcement Learning*
作者	Mingduo Lin 1; Bo Zhao 1; Hongbing Xia 2; Derong Liu3
DOI	10.1109/CSIS-IAC60628.2023.10363836
发表日期	2023
ISBN	979-8-3503-0901-0
会议录名称	2023 International Annual Conference on Complex Systems and Intelligent Science (CSIS-IAC)
页码	75-79
会议日期	20-22 Oct. 2023
会议地点	Shenzhen, China
摘要	In this paper, we investigate the safety-aware optimal control (SAOC) problem, which attempts to minimize a predefined performance index function while ensuring the safety of nonlinear systems. First, the barrier function-based system transformation is utilized to design an optimal control policy which maintains the system states located in the safety region. To deal with the input constraints, a non-quadratic cost function is imposed to the control input. Then, the Hamilton-Jacobi-Bellman equation is established to provide the solution of the SAOC problem. Moreover, by utilizing the off-policy Bellman equation, a data-based off-policy reinforcement learning (OPRL) algorithm is developed to obtain the safety-aware optimal controller in a model-free manner. To implement this algorithm, a data collection process with the barrier transform is executed to generate the off-policy trajectory data, and an actor-critic neural network structure with the least-square updating law is employed in the off-policy learning phase. Finally, a simulation example is provided to verify the effectiveness of the developed control method.
关键词	Safety-aware control reinforcement learning adaptive dynamic programming off-policy
学校署名	其他
相关链接	[IEEE记录]
收录类别	EI
EI入藏号	20240415419451
EI主题词	Adaptive control systems ; Cost functions ; Dynamic programming ; Learning algorithms ; Learning systems ; Nonlinear systems
EI分类号	Artificial Intelligence:723.4 ; Machine Learning:723.4.2 ; Control Systems:731.1 ; Optimization Techniques:921.5 ; Systems Science:961
来源库	IEEE
全文链接	https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10363836
引用统计
成果类型	会议论文
条目标识符	http://sustech.caswiz.com/handle/2SGJ60CL/673736
专题	工学院_系统设计与智能制造学院
作者单位	1.School of Systems Science, Beijing Normal University, Beijing, China 2.School of Artificial Intelligence, Anhui University, Hefei, China 3.School of System Design and Intelligent Manufacturing, Southern University of Science and Technology, Shenzhen, China
推荐引用方式 GB/T 7714	Mingduo Lin,Bo Zhao,Hongbing Xia,et al. Safety-Aware Optimal Control of Nonlinear Systems Using Off-Policy Reinforcement Learning*[C],2023:75-79.