Journal of East China Normal University(Natural Sc ›› 2018, Vol. 2018 ›› Issue (5): 67-78.doi: 10.3969/j.issn.1000-5641.2018.05.006

Previous Articles     Next Articles

The designs and implementations of columnar storage in Cedar

YU Wen-qian, HU Shuang, HU Hui-qi   

  1. School of Data Science and Engineering, East China Normal University, Shanghai 200062, China
  • Received:2018-07-09 Online:2018-09-25 Published:2018-09-26

Abstract: With the growing size of data and analytical needs, the query performance of databases for OLAP (On-Line Analytical Processing) applications has become increasingly important. Cedar is a distributed relational database based on read-write decoupled architecture. Since Cedar is mainly oriented to the needs of OLTP (On-Line Transaction Processing) applications, it has insufficient performance for handling analytical processing workloads. To address this issue, many studies have shown that column storage technology can effectively improve the efficiency of I/O (Input/Output) and enhance the performance of analytical processing. This paper presents a column-based storage mechanism in Cedar. The study analyzes applicable scenarios and improves Cedar's data query and batch update methods for this mechanism. The results of an experiment demonstrate that the proposed mechanism can enhance the performance of analytical processing substantially, while limiting the negative impacts on transaction processing performance to within 10%.

Key words: distributed database, column-based storage, OLAP

CLC Number: