题名 | HIGH-PRECISION STEREO MATCHING ACCELERATOR BASED ON GRADIENT INFORMATION |
其他题名 | 基于梯度信息的高精度立体匹配加速器设计
|
姓名 | |
姓名拼音 | LI Ke
|
学号 | 12132452
|
学位类型 | 硕士
|
学位专业 | 0809 电子科学与技术
|
学科门类/专业学位类别 | 08 工学
|
导师 | |
导师单位 | 深港微电子学院
|
论文答辩日期 | 2024-05-16
|
论文提交日期 | 2024-06-16
|
学位授予单位 | 南方科技大学
|
学位授予地点 | 深圳
|
摘要 | Stereoscopic vision, mimicking the human binocular system to perceive depth, finds extensive applications in areas such as autonomous driving, 3D reconstruction, and facial recognition. Designing a dedicated hardware processor for stereoscopic vision poses the challenge of balancing high accuracy with high speed and low resource consumption. Addressing the matching challenges in stereoscopic vision, this paper proposes a novel semi-global stereo-matching hardware accelerator. This accelerator significantly enhances the accuracy of disparity information processing and optimizes hardware performance by executing four key steps collaboratively. The specific work and innovations are outlined as follows: Firstly, in the initial cost calculation stage, a pipeline-style initial cost calculation architecture based on gradient information is proposed to enhance the robustness of stereo matching. By adding a gradient calculation module, the requirement for a row buffer is drastically reduced, achieving over 60% resource savings. Linear approximation methods are employed to optimize the hardware implementation of exponential functions, significantly reducing resource usage while minimizing precision loss. Secondly, in the cost aggregation stage, this paper combines dual-path cost aggregation with guided filter aggregation. The aggregation path is optimized, and the critical path of aggregation calculation is split into multiple clock cycles, enhancing the system's operating frequency. Additionally, by streamlining the computation process of guided filtering and utilizing multiplication instead of division, dual optimization of computational efficiency and resource usage is achieved. Furthermore, the disparity calculation section introduces sub-pixel interpolation techniques based on SRT division, utilizing additional fixed-point decimals to refine disparities, thus improving the accuracy of the disparity map for distant objects. An average accuracy gain of 2% is achieved on the KITTI2015 dataset. Lastly, in the post-processing stage, an efficient synchronous hole-filling and median filtering hardware architecture is designed. Significant resource savings are achieved through cascading row buffer reuse. Additionally, two different effects can be selectively output for occluded regions to meet diverse application requirements. In terms of performance evaluation, this study achieves an average error rate of 5.26% on the KITTI2015 dataset, representing a reduction of 1.62% compared to other semi-global stereo-matching research. The designed architecture requires only 94,510 LUTs on the Stratix-V FPGA platform under conditions of 1920×1080 resolution and a 128-disparity range, significantly lower than other studies. Moreover, the system operates at a frequency of 125MHz, achieving a high throughput of 60 frames per second. |
其他摘要 | 双目立体视觉是一种模仿人类双眼以获取距离感知的技术,被广泛应用于自动驾驶汽车、三维重建、人脸识别等领域。设计一款专用的双目立体视觉硬件处理器面临着平衡高精度、高速度与低资源消耗之间的挑战。针对双目立体视觉的匹配挑战,本文提出了一种全新的半全局立体匹配硬件加速器,该加速器通过协同执行四个关键步骤显著提升了视差信息处理的精度并优化了硬件性能。具体工作和创新点如下: 首先,在初始代价计算阶段,本文提出了一种基于梯度信息的流水线式初始代价计算架构,以提高立体匹配的鲁棒性。通过增加梯度计算模块,大幅减少行缓存需求,实现了超过60%的资源节省。采用线性逼近方法以优化指数函数的硬件实现,在最小化精度损失的同时显著降低资源使用。 其次,在代价聚合环节,本文结合了双路径代价聚合和引导滤波聚合。优化了聚合路径并将聚合计算的关键路径拆分多个时钟周期计算,提升了系统的工作频率。此外,通过精简引导滤波的计算过程,利用乘法代替除法的思想,实现了计算效率与资源占用的双重优化。 进一步地,视差计算部分引入了了基于SRT除法的亚像素插值技术,利用额外的定点小数来细化视差,提高了对远处物体视差图的精确度。在KITTI2015数据集上达到了2%的平均精度增益。 最后,在后处理阶段,设计了一种高效的同步空洞填充和中值滤波硬件架构,通过级联行缓存的复用显著降低了资源使用。同时针对遮挡区域可选择性输出两种效果,以满足多变的应用需求。 在性能评估方面,本研究在KITTI2015数据集上实现了5.26%的错误率,相较于其他半全局立体匹配研究降低了1.62个百分点。所设计架构在Stratix-V FPGA平台、1920×1080分辨率、128视差范围等同条件下,LUTs仅需94,510个,显著低于其他研究。并且,系统可以工作在125MHz的频率下实现每秒60帧的高吞吐量。 |
关键词 | |
其他关键词 | |
语种 | 英语
|
培养类别 | 独立培养
|
入学年份 | 2021
|
学位授予年份 | 2024-06
|
参考文献列表 | [1] LI M, KWOH L K, YANG C J, et al. 3D building extraction with semi-global match-ing from stereo pair worldview-2 satellite imageries[C]. 2015 IEEE International Ge-oscience and Remote Sensing Symposium (IGARSS). IEEE, 2015: 3006-3009. |
所在学位评定分委会 | 电子科学与技术
|
国内图书分类号 | TN47
|
来源库 | 人工提交
|
成果类型 | 学位论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/765631 |
专题 | 南方科技大学 南方科技大学-香港科技大学深港微电子学院筹建办公室 |
推荐引用方式 GB/T 7714 |
Li K. HIGH-PRECISION STEREO MATCHING ACCELERATOR BASED ON GRADIENT INFORMATION[D]. 深圳. 南方科技大学,2024.
|
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | 操作 | |
12132452-李可-南方科技大学-香(3204KB) | -- | -- | 限制开放 | -- | 请求全文 |
个性服务 |
原文链接 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
导出为Excel格式 |
导出为Csv格式 |
Altmetrics Score |
谷歌学术 |
谷歌学术中相似的文章 |
[李可]的文章 |
百度学术 |
百度学术中相似的文章 |
[李可]的文章 |
必应学术 |
必应学术中相似的文章 |
[李可]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论