华东师范大学学报(自然科学版) ›› 2014, Vol. 2014 ›› Issue (5): 164-172.doi: 10.3969/j.issn.10005641.2014.05.014

• 计算机科学与技术 • 上一篇    下一篇

OceanBase分布式存储引擎

黄贵,庄明强   

  1. 阿里巴巴集团公司,杭州 310000
  • 出版日期:2014-09-25 发布日期:2014-11-27
  • 作者简介:第一作者:黄贵,男,阿里巴巴集团技术专家,研究方向为分布式数据库.Email:qushan@alipay.com.

Scalable distributed storage of OceanBase

 HUANG  Gui, ZHUANG  Ming-Qiang   

  1. Alibaba Group, Hangzhou 310000, China
  • Online:2014-09-25 Published:2014-11-27

摘要: OceanBase是一个分布式关系型数据库,其目的是存储海量的高速增长的结构化数据,以廉价的服务器集群实现高可用、高可扩展、高性价比的服务.OceanBase采用内外存混合存储的模式,使用内存存储增量(新写入)数据,而使用外存存储基线(只读)数据,并将基线数据划分成大致等量的数据分片并采用分布式B+ tree的形式将分片存放在很多的数据服务器上,利用定时合并机制不断将增量数据与基线数据融合.本文介绍OceanBase基线数据存储的基本结构和分布方式、定时合并机制,以及基线数据在OceanBase中的具体存储格式的设计和实现.

关键词: 存储引擎, 分布式系统, 每日合并, 分块存储, 基线数据, 增量数据

Abstract: OceanBase is a distributed relational database, its purpose is to store vast amounts of structured data in highgrowth, lowcost servers to achieve high availability, high scalability and costeffective services. OceanBase using memory and external store hybrid storage mode, stores the incremental (update) data in memory, and the baseline (readonly) data in external storage (usually disk), baseline data is divided into slices we called tablet roughly the same amount of data and the use of distributed B+ tree stored on many data servers, using the daily merge mechanism to keep the combined incremental data into baseline.This article describes the basic structure and distribution methods of OceanBase baseline data storage, as well as the daily merge mechanism, in addition, we will introduce in OceanBase baseline data storage format of the specific design and implementation.

Key words: storage engine, distributed system, daily merge, block, stable store, base data, increment data

中图分类号: