题名 | GHive: accelerating analytical query processing in apache hive via CPU-GPU heterogeneous computing |
作者 | |
通讯作者 | Bo Tang |
DOI | |
发表日期 | 2022-11-07
|
会议名称 | Proceedings of the 13th Symposium on Cloud Computing
|
ISSN | 9781450394147
|
会议日期 | November 7 - 11, 2022
|
会议地点 | San Francisco
|
摘要 | As a popular distributed data warehouse system, Apache Hive has been widely used for big data analytics in many organizations. Meanwhile, exploiting the massive parallelism of GPU to accelerate online analytical processing (OLAP) has been extensively explored in the database community. In this paper, we present GHive, which enhances CPU-based Hive via CPU-GPU heterogeneous computing. GHive is designed for the business intelligence applications and provides the same API as Hive for compatibility. To run SQL queries jointly on both CPU and GPU, GHive comes with three key techniques: (i) a novel data model gTable, which is column-based and enables efficient data movement between CPU memory and GPU memory; (ii) a GPU-based operator library Panda, which provides a complete set of SQL operators with extensively optimized GPU implementations; (iii) a hardware-aware MapReduce job placement scheme, which puts jobs judiciously on either GPU or CPU via a cost-based approach. In the experiments, we observe that GHive outperforms Hive in both query processing speed and operating expense on the Star Schema Benchmark (SSB). |
学校署名 | 第一
; 通讯
|
来源库 | 人工提交
|
引用统计 |
被引频次[WOS]:4
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/415608 |
专题 | 工学院_计算机科学与工程系 |
作者单位 | 1.Research Inst. of Trustworthy Autonomous Systems, Southern University of Science and Technology,Department of Computer Science and Engineering, Southern University of Science and Technology 2.Aalborg University 3.The Hong Kong Polytechnic University 4.Boston University 5.Huawei Technologies Co., Ltd |
第一作者单位 | 计算机科学与工程系 |
通讯作者单位 | 计算机科学与工程系 |
第一作者的第一单位 | 计算机科学与工程系 |
推荐引用方式 GB/T 7714 |
Haotian Liu,Bo Tang,Jiashu Zhang,et al. GHive: accelerating analytical query processing in apache hive via CPU-GPU heterogeneous computing[C],2022.
|
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | 操作 | |
ghive.pdf(3081KB) | -- | -- | 限制开放 | -- |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论