题名 | TT-LCD: Tensorized-Transformer based Loop Closure Detection for Robotic Visual SLAM on Edge |
作者 | |
DOI | |
发表日期 | 2023
|
ISBN | 979-8-3503-0018-5
|
会议录名称 | |
页码 | 166-172
|
会议日期 | 8-10 July 2023
|
会议地点 | Sanya, China
|
摘要 | Visual simultaneous localization and mapping (VSLAM) is one of the core technologies in autonomous driving, intelligent robots, metaverse and other fields. Besides, loop closure detection (LCD) is an essential component in VSLAM which can correct the drift and accumulated errors caused by the visual odometry (VO) front-end, and assist robot to build a globally consistent map. Over the years, several deep-learning methods have been proposed to address the task. However, the prior proposed neural network-based LCD models are heavy in model size, and difficult to be deployed on edge devices. In this paper, an LCD module based on the tensorized transformer model called TT-LCD is proposed. To obtain a tensorized transformer model with accuracy-complexity co-awareness which can be effectively deployed, we proposed a construction method for tensor compressed transformer model with tensor-train (TT) decomposition and a differential neural network architecture search (NAS) method for tensor rank selection. Experiments demonstrate that the TT-LCD realizes a model size 6.04 × smaller than uncompressed transformer model, 32.1 × smaller than the VGG model and achieves lower memory cost of about 134M on edge CPU with little loss of accuracy on pre-training dataset but even 2.13% higher average accuracy on NewCollege dataset compared with uncompressed DeiT-based model in LCD task. |
关键词 | |
学校署名 | 第一
|
相关链接 | [IEEE记录] |
收录类别 | |
EI入藏号 | 20233814760388
|
EI主题词 | Deep learning
; Learning systems
; Network architecture
; Tensors
|
EI分类号 | Ergonomics and Human Factors Engineering:461.4
; Robot Applications:731.6
; Algebra:921.1
|
来源库 | IEEE
|
全文链接 | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10218828 |
引用统计 |
被引频次[WOS]:0
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/559158 |
专题 | 工学院_深港微电子学院 |
作者单位 | School of Microelectronics, Southern University of Science and Technology, Shenzhen, China |
第一作者单位 | 深港微电子学院 |
第一作者的第一单位 | 深港微电子学院 |
推荐引用方式 GB/T 7714 |
Chenchen Ding,Hongwei Ren,Zhiru Guo,et al. TT-LCD: Tensorized-Transformer based Loop Closure Detection for Robotic Visual SLAM on Edge[C],2023:166-172.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论