题名 | Effective and Efficient Summarization for Non-hierarchical Data |
作者 | |
DOI | |
发表日期 | 2021
|
ISSN | 2767-9519
|
ISBN | 978-1-6654-2331-1
|
会议录名称 | |
页码 | 100-106
|
会议日期 | 2-3 Dec. 2021
|
会议地点 | Moscow, Russian Federation
|
摘要 | Data cubes are ubiquitous in domains including meteorology, sales and demography, and data summarization is an important service that can be used to compress data cubes and extract insight. Existing data summarization methods require presupposed hierarchies for the data cube dimensions, which do not exist for many types of data (e.g., rainfall and temperature). To tackle this problem, we first define the non-hierarchical data summarization (NHDS) problem, which covers data cube using rectangle regions with an error bound and minimizes the summary size. We then show that the NHDS problem is NP-hard and design the Mark and Select (MS) algorithm to find an approximate solution. MS first identifies the qualified rectangles and then selects among the rectangles to cover the data cube. To improve efficiency, we show that it suffices to find only some of the qualified rectangles, devise a procedure to avoid checking rectangles that do not contribute to the result, save unnecessary computation during rectangle selection using sub-modularity. We conducted experiments on both real and synthetic datasets. The results show that MS significantly outperforms a state-of-the-art baseline in summary size, error and running time. |
关键词 | |
学校署名 | 其他
|
相关链接 | [IEEE记录] |
收录类别 | |
来源库 | IEEE
|
全文链接 | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9708661 |
引用统计 |
被引频次[WOS]:0
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/347990 |
专题 | 南方科技大学 |
作者单位 | 1.National University of Defense Technology,Changsha,China 2.Southern University of Science and Technology,Shenzhen,China |
推荐引用方式 GB/T 7714 |
Xiang Ji,Xiao Yan,Kaijun Ren,et al. Effective and Efficient Summarization for Non-hierarchical Data[C],2021:100-106.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论