中文版 | English
题名

Manu: A Cloud Native Vector Database Management System

作者
发表日期
2022
DOI
发表期刊
EISSN
2150-8097
卷号15期号:12页码:3548-3561
摘要
With the development of learning-based embedding models, embedding vectors are widely used for analyzing and searching unstructured data. As vector collections exceed billion-scale, fully managed and horizontally scalable vector databases are necessary. In the past three years, through interaction with our 1200+ industry users, we have sketched a vision for the features that next-generation vector databases should have, which include long-term evolvability, tunable consistency, good elasticity, and high performance. We present Manu, a cloud native vector database that implements these features. It is difficult to integrate all these features if we follow traditional DBMS design rules. As most vector data applications do not require complex data models and strong data consistency, our design philosophy is to relax the data model and consistency constraints in exchange for the aforementioned features. Specifically, Manu firstly exposes the write-ahead log (WAL) and binlog as backbone services. Secondly, write components are designed as log publishers while all read-only analytic and search components are designed as independent subscribers to the log services. Finally, we utilize multi-version concurrency control (MVCC) and a delta consistency model to simplify the communication and cooperation among the system components. These designs achieve a low coupling among the system components, which is essential for elasticity and evolution. We also extensively optimize Manu for performance and usability with hardware-aware implementations and support for complex search semantics. Manu has been used for many applications, including, but not limited to, recommendation, multimedia, language, medicine and security. We evaluated Manu in three typical application scenarios to demonstrate its efficiency, elasticity, and scalability.
相关链接[Scopus记录]
收录类别
语种
英语
学校署名
其他
EI入藏号
20223812759646
EI主题词
Concurrency control ; Database systems ; Elasticity ; Embeddings ; Semantics
EI分类号
Database Systems:723.3 ; Artificial Intelligence:723.4 ; Algebra:921.1
Scopus记录号
2-s2.0-85138001520
来源库
Scopus
引用统计
被引频次[WOS]:7
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/402784
专题南方科技大学
作者单位
1.Zilliz,China
2.Southern University of Science and Technology,China
3.Technical University of Munich,Germany
推荐引用方式
GB/T 7714
Guo,Rentong,Luan,Xiaofan,Xiang,Long,et al. Manu: A Cloud Native Vector Database Management System[J]. Proceedings of the VLDB Endowment,2022,15(12):3548-3561.
APA
Guo,Rentong.,Luan,Xiaofan.,Xiang,Long.,Yan,Xiao.,Yi,Xiaomeng.,...&Xie,Charles.(2022).Manu: A Cloud Native Vector Database Management System.Proceedings of the VLDB Endowment,15(12),3548-3561.
MLA
Guo,Rentong,et al."Manu: A Cloud Native Vector Database Management System".Proceedings of the VLDB Endowment 15.12(2022):3548-3561.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Guo,Rentong]的文章
[Luan,Xiaofan]的文章
[Xiang,Long]的文章
百度学术
百度学术中相似的文章
[Guo,Rentong]的文章
[Luan,Xiaofan]的文章
[Xiang,Long]的文章
必应学术
必应学术中相似的文章
[Guo,Rentong]的文章
[Luan,Xiaofan]的文章
[Xiang,Long]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。