中文版 | English
题名

Unsupervised Part Discovery via Dual Representation Alignment

作者
发表日期
2024
DOI
发表期刊
ISSN
0162-8828
EISSN
1939-3539
卷号PP期号:99页码:1-18
摘要
Object parts serve as crucial intermediate representations in various downstream tasks, but part-level representation learning still has not received as much attention as other vision tasks. Previous research has established that Vision Transformer can learn instance-level attention without labels, extracting high-quality instance-level representations for boosting downstream tasks. In this paper, we achieve unsupervised part-specific attention learning using a novel paradigm and further employ the part representations to improve part discovery performance. Specifically, paired images are generated from the same image with different geometric transformations, and multiple part representations are extracted from these paired images using a novel module, named PartFormer. These part representations from the paired images are then exchanged to improve geometric transformation invariance. Subsequently, the part representations are aligned with the feature map extracted by a feature map encoder, achieving high similarity with the pixel representations of the corresponding part regions and low similarity in irrelevant regions. Finally, the geometric and semantic constraints are applied to the part representations through the intermediate results in alignment for part-specific attention learning, encouraging the PartFormer to focus locally and the part representations to explicitly include the information of the corresponding parts. Moreover, the aligned part representations can further serve as a series of reliable detectors in the testing phase, predicting pixel masks for part discovery. Extensive experiments are carried out on four widely used datasets, and our results demonstrate that the proposed method achieves competitive performance and robustness due to its part-specific attention. The code will be released upon paper acceptance.
IEEE
相关链接[IEEE记录]
收录类别
语种
英语
学校署名
其他
出版者
EI入藏号
20243516932324
EI主题词
Alignment ; Image representation ; Job analysis ; Mathematical transformations ; Semantic Segmentation ; Unsupervised learning
EI分类号
:1101.2 ; :1106.8 ; :1201.3 ; Mechanical Devices:601.1 ; Management:912.2
ESI学科分类
ENGINEERING
来源库
EV Compendex
引用统计
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/807144
专题工学院_计算机科学与工程系
南方科技大学
作者单位
1.Faculty of Engineering and IT, University of Technology Sydney, Ultimo, NSW, Australia
2.Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong, China
3.Department of Computer Science, The University of Hong Kong, Hong kong
推荐引用方式
GB/T 7714
Xia, Jiahao,Huang, Wenjian,Xu, Min,et al. Unsupervised Part Discovery via Dual Representation Alignment[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2024,PP(99):1-18.
APA
Xia, Jiahao.,Huang, Wenjian.,Xu, Min.,Zhang, Jianguo.,Zhang, Haimin.,...&Xu, Dong.(2024).Unsupervised Part Discovery via Dual Representation Alignment.IEEE Transactions on Pattern Analysis and Machine Intelligence,PP(99),1-18.
MLA
Xia, Jiahao,et al."Unsupervised Part Discovery via Dual Representation Alignment".IEEE Transactions on Pattern Analysis and Machine Intelligence PP.99(2024):1-18.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Xia, Jiahao]的文章
[Huang, Wenjian]的文章
[Xu, Min]的文章
百度学术
百度学术中相似的文章
[Xia, Jiahao]的文章
[Huang, Wenjian]的文章
[Xu, Min]的文章
必应学术
必应学术中相似的文章
[Xia, Jiahao]的文章
[Huang, Wenjian]的文章
[Xu, Min]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。