题名 | Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval |
作者 | |
通讯作者 | You,Xinge |
发表日期 | 2021
|
ISSN | 1045-0823
|
会议录名称 | |
页码 | 1106-1112
|
摘要 | Zero-shot sketch-based image retrieval (ZS-SBIR), which aims to retrieve photos with sketches under the zero-shot scenario, has shown extraordinary talents in real-world applications. Most existing methods leverage language models to generate class-prototypes and use them to arrange the locations of all categories in the common space for photos and sketches. Although great progress has been made, few of them consider whether such pre-defined prototypes are necessary for ZS-SBIR, where locations of unseen class samples in the embedding space are actually determined by visual appearance and a visual embedding actually performs better. To this end, we propose a novel Norm-guided Adaptive Visual Embedding (NAVE) model, for adaptively building the common space based on visual similarity instead of language-based pre-defined prototypes. To further enhance the representation quality of unseen classes for both photo and sketch modality, modality norm discrepancy and noisy label regularizer are jointly employed to measure and repair the modality bias of the learned common embedding. Experiments on two challenging datasets demonstrate the superiority of our NAVE over state-of-the-art competitors. |
学校署名 | 其他
|
语种 | 英语
|
相关链接 | [Scopus记录] |
收录类别 | |
资助项目 | National Natural Science Foundation of China[61772220];Science, Technology and Innovation Commission of Shenzhen Municipality[JCYJ20180305180637611];Science, Technology and Innovation Commission of Shenzhen Municipality[JCYJ20180305180804836];Science, Technology and Innovation Commission of Shenzhen Municipality[JSGG20180507182030600];
|
EI入藏号 | 20220911735453
|
EI主题词 | Artificial intelligence
; Embeddings
; Image retrieval
; Visual languages
|
EI分类号 | Computer Programming Languages:723.1.1
; Artificial Intelligence:723.4
|
Scopus记录号 | 2-s2.0-85120554331
|
来源库 | Scopus
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/328191 |
专题 | 工学院_计算机科学与工程系 |
作者单位 | 1.School of Electronic Information and Communication,Huazhong University of Science and Technology, 2.Shenzhen Research Institute,Huazhong University of Science and Technology, 3.Department of Computer Science and Engineering,Southern University of Science and Technology, |
推荐引用方式 GB/T 7714 |
Wang,Wenjie,Shi,Yufeng,Chen,Shiming,et al. Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval[C],2021:1106-1112.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论