Journal of East China Normal University(Natural Science) ›› 2024, Vol. 2024 ›› Issue (5): 141-151.doi: 10.3969/j.issn.1000-5641.2024.05.013

• Educational Data Management • Previous Articles     Next Articles

Online analytical processing query cardinality estimation capability evaluation

Wei JIAN, Zirui HU, Rong ZHANG*()   

  1. School of Data Science and Engineering, East China Normal University, Shanghai 200062, China
  • Received:2024-07-04 Accepted:2024-07-28 Online:2024-09-25 Published:2024-09-23
  • Contact: Rong ZHANG E-mail:rzhang@dase.ecnu.edu.cn

Abstract:

Query optimization can significantly enhance the analysis efficiency of online analytical processing (OLAP) database systems for massive educational data, providing fast and accurate data support for intelligent educational systems. The optimizer mainly consists of three modules: cardinality estimation, space enumeration, and cost models. Specifically, cardinality estimation determines the results of the cost model and guides the selection of query plans. Therefore, the evaluation of the cardinality estimation module of the optimizer plays a crucial role in the optimization of OLAP database systems. This study designs and implements an effective workload generation tool based on primary key-driven diversified data distribution and data relationship construction. The tool includes data generation technology with custom relationships, workload template generation technology based on finite state machines, and parameter instantiation technology driven by target cardinality. Experiments were conducted on three databases: OceanBase, TiDB, and PostgreSQL, analyzing the issues of their optimizers and providing suggestions.

Key words: OLAP database, query optimization, cardinality estimation

CLC Number: