Sublink elimination and optimization in data storage and processing separation architecture

  • WANG Yan-zhao ,
  • HU Hui-qi ,
  • ZHANG Zhao ,
  • LIU Xiao-bing ,
  • DUAN Hui-chao
Expand
  • School of Data Science and Engineering, East China Normal University, Shanghai 200062, China

Received date: 2017-06-26

  Online published: 2018-07-19

Abstract

This paper implemented a sublink elimination optimizer in NewSQL RDBMS (Relational Database Management System). The purpose of the optimizer is to make complex SQL so that it can be executable in New SQL systems, and achieve auto-tuning of complex SQL to reduce the cost of migrating from a centralized database to a distributed database; this would make the NewSQL DBMS (Database Management System) available to telecomunications and banking industries. We also provide a method to minimize data transmission between servers after sublink promotion. Experiments show that the optimization effect of the query optimizer is notable and can replace manual optimization and reduce the workload of application migration.

Cite this article

WANG Yan-zhao , HU Hui-qi , ZHANG Zhao , LIU Xiao-bing , DUAN Hui-chao . Sublink elimination and optimization in data storage and processing separation architecture[J]. Journal of East China Normal University(Natural Science), 2018 , 2018(4) : 90 -98 . DOI: 10.3969/j.issn.1000-5641.2018.04.009

References

[1] CHANG F, DEAN J, GHEMAWAT S, et al. Bigtable:A distributed storage system for structured data[C]//Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation. Berkeley:USENIX Associaition, 2006, 205-218.
[2] CHODOROW K. MongoDB:The Definitive Guide[M].[S.l.]:O'Reilly Media Inc, 2013.
[3] CORBETT J C, DEAN J, EPSTEIN M, et al. Spanner:Google's globally distributed database[C]//Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation. Berkeley:USENIX Associaition, 2012, 251-264.
[4] ALIBABA. OceanBase[EB/OL].[2017-05-02]. https://github.com/alibaba/oceanbase.
[5] daseECNU. Cedar[EB/OL].[2017-05-02]. https://github.com/daseECNU/Cedar.
[6] STONEBRAKER M, HANSON E, HONG C H. The design of POSTGRES rules system[C]//1987 IEEE 3rd International Conference on Data Engineering. IEEE, 1987:365-374.
[7] 李海翔. 数据库查询优化器的艺术:原理解析与SQL性能优化[M]. 北京:机械工业出版社, 2014:2-263.
[8] SESHADRI P, PIRAHESH H, LEUNG T Y C. Complex query decorrelation[C]//Proceedings of the 12th International Conference on Data Engineering. IEEE, 1996:450-458.
[9] RAO J, ROSS K A. Reusing invariants:A new strategy for correlated queries[J]. ACM SIGMOD Record, 1998, 27(2):37-48.
[10] BURTON H. Bloom, space/time trade-offs in hash coding with allowable errors[J]. Communications of the ACM, 1970, 13(7):422-426.
Outlines

/