华东师范大学学报(自然科学版) ›› 2019, Vol. 2019 ›› Issue (5): 190-202.doi: 10.3969/j.issn.1000-5641.2019.05.016

• 新时期数据管理技术 • 上一篇    

Woodpecker+:基于数据特征的自定义负载性能评测

张涛, 张小磊, 李宇明, 张春熙, 张蓉   

  1. 华东师范大学 数据科学与工程学院, 上海 200062
  • 收稿日期:2019-07-28 出版日期:2019-09-25 发布日期:2019-10-11
  • 通讯作者: 张蓉,女,教授,研究方向为分布式数据管理.E-mail:rzhang@dase.ecnu.edu.cn. E-mail:rzhang@dase.ecnu.edu.cn
  • 作者简介:张涛,男,硕士研究生.研究方向为数据库基准测试.E-mail:zhangtao1906@163.com.
  • 基金资助:
    国家重大专项(2018YFB1003404);国家自然科学基金(61432006)

Woodpecker+: Customized workload performance evaluation based on data characteristics

ZHANG Tao, ZHANG Xiao-lei, LI Yu-ming, ZHANG Chun-xi, ZHANG Rong   

  1. School of Data Science and Engineering, East China Normal Univeristy, Shanghai 200062, China
  • Received:2019-07-28 Online:2019-09-25 Published:2019-10-11

摘要: 数据库的性能评测随着复杂多样的应用出现变得更加重要.在很多情况下,研究、开发人员的性能评估工作受限于负载的缺乏.虽然OLTP-Bench通用数据库性能测试框架在一定程度上提高了性能测试的效率,但由于标准Benchmark负载固定且针对用户多样的应用场景的代表性差,故无法精准地满足不同应用的系统性能;此外,大部分测试框架使用高级编程语言支持为应用编写测试负载,不仅增加测试负担还会引入大量重复性工作,导致测试效率低下.本文设计并实现了一个用户自定义性能测试负载的工具:Woodpecker+.该工具的主要贡献为:易于使用和可拓展、提供了可高效构造测试案例、做测试安排的测试定义语言(TDL)、可灵活控制事务执行模式及数据访问分布、实现了轻量级的细粒度统计信息收集与分析、支持多种主流数据库系统DBMS及提供数据库访问接口的其他数据库.通过一组详细的运行在主流DBMS上的自定义负载实验来验证Woodpecker+的特性.

关键词: 数据库, 性能测试, 自定义负载

Abstract: There are a number of performance testing tools, like Sysbench and OLTPBench, that can be used to benchmark the testing of database performance. However, because the standard benchmark workload is fixed and application scenarios for users are not always representative, it is impossible to accurately determine system performance. Moreover, if users are required to use a high-level programming language to implement a test workload separately for each application, this will undoubtedly introduce a substantial amount of repetitive work, resulting in inefficient testing. To address these issues, this paper designs and implements a user-defined performance test workload tool. The main benefits of this tool can be summarized as follows:It is easy to use and expandable; it provides a test definition language (TDL) for efficient construction of test cases; and it offers flexible control for mixed execution of transactions, data access distribution, lightweight and granular statistical information collection and analysis, and support for multiple mainstream DBMSs and other databases that provide database access interfaces. We highlight the tool's features through a detailed set of customized workload experiments running on the mainstream DBMS.

Key words: database, performance test, customized workload

中图分类号: