Journal of East China Normal University(Natural Sc ›› 2017, Vol. 2017 ›› Issue (5): 1-10.doi: 10.3969/j.issn.1000-5641.2017.05.001

• Data Management •     Next Articles

Parallel join based on distributed system OceanBase

XU Shi-lei, WANG Lei, HU Hui-qi, QIAN Wei-ning, ZHOU Ao-ying   

  1. School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China
  • Received:2017-06-19 Online:2017-09-25 Published:2017-09-25

Abstract: With the rapid growth of application data and the continued development of distributed database systems, data storage in physical independent nodes has become a trend. In this trend, when the application needs to perform complex join queries, it inevitably generates a lot of network traffic. Therefore, improving the efficiency of join query in distributed system is a hot topic. Based on the analysis of the nested loop join, Hash join, semi-join in the OceanBase, this paper puts forward the optimization idea of using hardware resources reasonably and using multithread to execute join operations in parallel. We implement experiment on OceanBase with nested loop join algorithm, Hash join algorithm, semi-join algorithm respectively. The experimental results confirm that the efficiency of join algorithm is positively related to parallelism in a certain number of threads.

Key words: query, semi-join, OceanBase, parallel join

CLC Number: