华东师范大学学报(自然科学版) ›› 2017, Vol. 2017 ›› Issue (5): 1-10.doi: 10.3969/j.issn.1000-5641.2017.05.001

• 数据管理 •    下一篇

基于分布式系统OceanBase的并行连接

徐石磊, 王雷, 胡卉芪, 钱卫宁, 周傲英   

  1. 华东师范大学 计算机科学与软件工程学院, 上海 200062
  • 收稿日期:2017-06-19 出版日期:2017-09-25 发布日期:2017-09-25
  • 通讯作者: 胡卉芪,男,助理研究员,研究方向为数据库.E-mail:hqhu@dase.ecnu.edu.cn E-mail:hqhu@dase.ecnu.edu.cn
  • 作者简介:徐石磊,男,硕士研究生,研究方向为数据存储与数据挖掘.E-mail:xsl118857@sina.com
  • 基金资助:
    2017年上海市青年科技英才扬帆计划(17YF1427800)

Parallel join based on distributed system OceanBase

XU Shi-lei, WANG Lei, HU Hui-qi, QIAN Wei-ning, ZHOU Ao-ying   

  1. School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China
  • Received:2017-06-19 Online:2017-09-25 Published:2017-09-25

摘要: 随着应用数据的飞速增长以及分布式数据库系统的不断涌现,数据存储在物理独立的节点已经成为一种趋势.在这种情况下,当应用需要进行复杂join查询时,就会不可避免地产生非常多的网络传输代价.所以,如何提高分布式系统中join查询的效率成为研究热点.本文在分析分布式数据库系统OceanBase执行nested loop join、Hashjoin、semi-join等算法的基础上,提出了合理利用硬件资源采用多线程并行执行join操作的优化思想,并在OceanBase数据库中分别对nested loop join、Hashjoin、semi-join等算法进行了并行改造.实验结果表明,在一定线程数内join算法执行效率与并行度呈正相关.

关键词: 查询, semi-join, OceanBase, 并行连接

Abstract: With the rapid growth of application data and the continued development of distributed database systems, data storage in physical independent nodes has become a trend. In this trend, when the application needs to perform complex join queries, it inevitably generates a lot of network traffic. Therefore, improving the efficiency of join query in distributed system is a hot topic. Based on the analysis of the nested loop join, Hash join, semi-join in the OceanBase, this paper puts forward the optimization idea of using hardware resources reasonably and using multithread to execute join operations in parallel. We implement experiment on OceanBase with nested loop join algorithm, Hash join algorithm, semi-join algorithm respectively. The experimental results confirm that the efficiency of join algorithm is positively related to parallelism in a certain number of threads.

Key words: query, semi-join, OceanBase, parallel join

中图分类号: