计算机科学
图形数据库
数据库
图形
理论计算机科学
作者
Wei Zhang,Cheng Chen,Qiange Wang,W. Wang,Shijiao Yang,Bingyu Zhou,Huiming Zhu,Chao Chen,Yongjun Zhao,Yingqian Hu,Miaomiao Cheng,Meng Li,Hong-fei Tan,Mengjin Liu,H. F. Lin,Shuai Zhang,L. W. Zhang
标识
DOI:10.1145/3626246.3653373
摘要
ByteDance's products, including TikTok, Douyin, and Toutiao, generate massive amounts of graph data every day. Previously, we developed ByteGraph, a distributed graph database that manages the large-scale graph data with varying performance requirements. BG3 is deployed on the computation and storage decoupled architecture, which allows for high performance in-memory execution and independent scaling of computation and storage layers. Byte-Graph has demonstrated robust performance throughout its years of service in global-scale applications. However, as the business scale expands and applications evolve, the complexity and volume of graph analysis and processing have also increased. We observe that conventional database design faces issues with high operational costs when dealing with the large-scale graph workloads in social network management. To address this issue, we develop BG3 (ByteGraph 3.0), a cost-effective and high performance distributed graph database which provides three critical components. Firstly, a cost-effective yet query-efficient graph storage engine based on the BW-tree-based memory indices and affordable cloud storage. Secondly, a workload aware space reclamation mechanism, which enhances storage utilization and reduces write amplifications. Thirdly, a lightweight leader-follower synchronization mechanism ensuring strong consistency for scaling out real-time graph analysis. Experimental results demonstrate that BG3 addresses the limitations of ByteGraph, offering a cost-effective, efficient, and scalable solution for processing ByteDance's large-scale graphs.
科研通智能强力驱动
Strongly Powered by AbleSci AI