题名 | Safety-Aware Optimal Control of Nonlinear Systems Using Off-Policy Reinforcement Learning* |
作者 | |
DOI | |
发表日期 | 2023
|
ISBN | 979-8-3503-0901-0
|
会议录名称 | |
页码 | 75-79
|
会议日期 | 20-22 Oct. 2023
|
会议地点 | Shenzhen, China
|
摘要 | In this paper, we investigate the safety-aware optimal control (SAOC) problem, which attempts to minimize a predefined performance index function while ensuring the safety of nonlinear systems. First, the barrier function-based system transformation is utilized to design an optimal control policy which maintains the system states located in the safety region. To deal with the input constraints, a non-quadratic cost function is imposed to the control input. Then, the Hamilton-Jacobi-Bellman equation is established to provide the solution of the SAOC problem. Moreover, by utilizing the off-policy Bellman equation, a data-based off-policy reinforcement learning (OPRL) algorithm is developed to obtain the safety-aware optimal controller in a model-free manner. To implement this algorithm, a data collection process with the barrier transform is executed to generate the off-policy trajectory data, and an actor-critic neural network structure with the least-square updating law is employed in the off-policy learning phase. Finally, a simulation example is provided to verify the effectiveness of the developed control method. |
关键词 | |
学校署名 | 其他
|
相关链接 | [IEEE记录] |
收录类别 | |
EI入藏号 | 20240415419451
|
EI主题词 | Adaptive control systems
; Cost functions
; Dynamic programming
; Learning algorithms
; Learning systems
; Nonlinear systems
|
EI分类号 | Artificial Intelligence:723.4
; Machine Learning:723.4.2
; Control Systems:731.1
; Optimization Techniques:921.5
; Systems Science:961
|
来源库 | IEEE
|
全文链接 | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10363836 |
引用统计 | |
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/673736 |
专题 | 工学院_系统设计与智能制造学院 |
作者单位 | 1.School of Systems Science, Beijing Normal University, Beijing, China 2.School of Artificial Intelligence, Anhui University, Hefei, China 3.School of System Design and Intelligent Manufacturing, Southern University of Science and Technology, Shenzhen, China |
推荐引用方式 GB/T 7714 |
Mingduo Lin,Bo Zhao,Hongbing Xia,et al. Safety-Aware Optimal Control of Nonlinear Systems Using Off-Policy Reinforcement Learning*[C],2023:75-79.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论