中文版 | English
题名

Spatial-Temporal Pyramid Graph Reasoning for Action Recognition

作者
发表日期
2022
DOI
发表期刊
ISSN
1941-0042
EISSN
1941-0042
卷号PP期号:99页码:1-1
摘要
Spatial-temporal relation reasoning is a significant yet challenging problem for video action recognition. Previous works typically apply local operations like 2D or 3D CNNs to conduct space-time interactions in video sequences, or simply capture space-time long-range relations of a single fixed scale. However, this is inadequate for obtaining a comprehensive action representation. Besides, most models treat all input frames equally for the final classification, without selecting key frames and motion-sensitive regions. This introduces irrelevant video content and hurts the performance of models. In this paper, we propose a generic Spatial-Temporal Pyramid Graph Network (STPG-Net) to adaptively capture long-range spatial-temporal relations in video sequences at multiple scales. Specifically, we design a temporal attention (TA) module and a spatial-temporal attention (STA) module to learn the contribution of each frame and each space-time region to an action at a feature level, respectively. We then apply the selected key information to build spatial-temporal pyramid graphs for long-range relation reasoning and more comprehensive action representation learning. STPG-Net can be flexibly integrated into 2D and 3D backbone networks in a plug-and-play manner. Extensive experiments show that it brings consistent improvements over many challenging baselines on several standard action recognition benchmarks (i.e., Something-Something V1 & V2, and FineGym), demonstrating the effectiveness of our approach.
关键词
相关链接[IEEE记录]
收录类别
SCI ; EI
语种
英语
学校署名
其他
资助项目
National Natural Science Foundation of China["61972188","62122035"]
WOS研究方向
Computer Science ; Engineering
WOS类目
Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS记录号
WOS:000844128200001
出版者
EI入藏号
20223412602333
EI主题词
Graphic methods ; Three dimensional displays ; Video recording
EI分类号
Television Systems and Equipment:716.4 ; Computer Peripheral Equipment:722.2
ESI学科分类
ENGINEERING
来源库
Web of Science
全文链接https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9852978
引用统计
被引频次[WOS]:8
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/375591
专题工学院_斯发基斯可信自主研究院
工学院_计算机科学与工程系
作者单位
1.School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, China
2.Department of Computer Science and Engineering and the Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, Shenzhen, China
3.Peng Cheng Laboratory, Shenzhen, China
4.Futurewei Technologies, Seattle, WA, USA
5.Terminus Group, China
推荐引用方式
GB/T 7714
Tiantian Geng,Feng Zheng,Xiaorong Hou,et al. Spatial-Temporal Pyramid Graph Reasoning for Action Recognition[J]. IEEE Transactions on Image Processing,2022,PP(99):1-1.
APA
Tiantian Geng,Feng Zheng,Xiaorong Hou,Ke Lu,Guo-Jun Qi,&Ling Shao.(2022).Spatial-Temporal Pyramid Graph Reasoning for Action Recognition.IEEE Transactions on Image Processing,PP(99),1-1.
MLA
Tiantian Geng,et al."Spatial-Temporal Pyramid Graph Reasoning for Action Recognition".IEEE Transactions on Image Processing PP.99(2022):1-1.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Tiantian Geng]的文章
[Feng Zheng]的文章
[Xiaorong Hou]的文章
百度学术
百度学术中相似的文章
[Tiantian Geng]的文章
[Feng Zheng]的文章
[Xiaorong Hou]的文章
必应学术
必应学术中相似的文章
[Tiantian Geng]的文章
[Feng Zheng]的文章
[Xiaorong Hou]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。