题名 | Local Sensing based Multi-agent Pursuit-evasion with Deep Reinforcement Learning |
作者 | |
DOI | |
发表日期 | 2022
|
ISSN | 2688-092X
|
ISBN | 978-1-6654-6534-2
|
会议录名称 | |
页码 | 6748-6752
|
会议日期 | 25-27 Nov. 2022
|
会议地点 | Xiamen, China
|
摘要 | In this paper, a problem of multi-agent pursuit-evasion in which multiple pursuers try to round up a single evader as soon as possible in a 2D limited space is considered. A cooperative approach of pursuit strategy through sensing among pursuits is developed under the multi-agent deep deterministic policy gradient (MADDPG) reinforcement learning (RL) method, which adopts the centralized training and distributed execution to deal with the pursuit problem by a fully distributed approach. Instead of using the communication information, each pursuer is supposed that can only obtain the sensing information of its neighbors and the evader. By introducing the sensing range segment and relative-position average strategy, we allow that the number of the pursuer can be changeable, which is different from some existing results that assume that the number of pursuers is fixed. To demonstrate the feasibility of the proposed method above, we implement it in a simulation environment. Simulation results show that the pursuer can learn a highly cooperative control strategy and capture the evader with a high success rate. |
关键词 | |
学校署名 | 其他
|
相关链接 | [IEEE记录] |
来源库 | IEEE
|
全文链接 | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10055841 |
引用统计 |
被引频次[WOS]:0
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/502104 |
专题 | 工学院_电子与电气工程系 |
作者单位 | 1.School of Automation (Artifical Intellengence), Hangzhou Dianzi University, Hangzhou, China 2.Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China |
推荐引用方式 GB/T 7714 |
Shaoxin Wang,Bo Wang,Zhimin Han,et al. Local Sensing based Multi-agent Pursuit-evasion with Deep Reinforcement Learning[C],2022:6748-6752.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论