中文版 | English
题名

Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection

作者
发表日期
2022
DOI
发表期刊
ISSN
1051-8215
EISSN
1558-2205
卷号33期号:6页码:1-1
摘要
The misalignment between classification and localization is a significant performance improvement point for object detection. To cope with the misalignment problem, more attempts have been made to separate different tasks (e.g., Classification, Bounding Box Regression) by introducing extra heads, which emphasizes the separation of multiple tasks to cope with their variability. In this paper, we consider that both separation and crosstalk are important between classification and localization. Considering that the two types of tasks are different and have different regions and features of interest, they are in conflict with each other and therefore need to be separated. However, they also need to be fused, because classification and localization are, after all, about understanding the same object. To realize this idea, we introduce bidirectional crosstalk detection head in a systematic manner to provide a full deep cross-fusion between classification and localization. To our best knowledge, it is the first time that full bidirectional crosstalk is introduced between classification and localization for one-stage detector. Extensive experiments are conducted to demonstrate the effectiveness of the proposed method. With a ResNet-50 backbone, our method can significantly improve the GFLV1 baseline by 2.0 AP with similar inference speed (18.5 fps vs. 18.3 fps) and further boost GFLV1 with a big margin (4.3 AP) by increasing our model size. Fair comparisons also show that the proposed head outperforms state-of-the-art heads (T-Head, DyHead) with comparable or faster inference speed under the same ATSS baseline model. With a Res2Net-DCN backbone, our model achieves 51.7 AP at single-model single-scale testing. The code and pretrained models will be made publicly available.
关键词
相关链接[Scopus记录]
收录类别
EI ; SCI
语种
英语
学校署名
其他
EI入藏号
20224613110664
EI主题词
Alignment ; Feature extraction ; Object detection ; Object recognition ; Separation
EI分类号
Mechanical Devices:601.1 ; Data Processing and Image Processing:723.2 ; Chemical Operations:802.3
ESI学科分类
ENGINEERING
Scopus记录号
2-s2.0-85141601655
来源库
Scopus
全文链接https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9934934
引用统计
被引频次[WOS]:4
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/411893
专题南方科技大学
作者单位
1.School of Computer Science and Technology, National University of Defense Technology, Changsha, China
2.Space Engineering University, Beijing, China
3.South University of Science and Technology and Harbin Institute of Technology, Shenzhen, China
推荐引用方式
GB/T 7714
Li,Yuanwei,Zhu,En,Chen,Hang,et al. Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,2022,33(6):1-1.
APA
Li,Yuanwei,Zhu,En,Chen,Hang,Tan,Jiyong,&Shen,Li.(2022).Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,33(6),1-1.
MLA
Li,Yuanwei,et al."Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection".IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 33.6(2022):1-1.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Li,Yuanwei]的文章
[Zhu,En]的文章
[Chen,Hang]的文章
百度学术
百度学术中相似的文章
[Li,Yuanwei]的文章
[Zhu,En]的文章
[Chen,Hang]的文章
必应学术
必应学术中相似的文章
[Li,Yuanwei]的文章
[Zhu,En]的文章
[Chen,Hang]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。