基于GPU的关系型流处理系统实现与优化

黄皓; 李志方; 王嘉伦; 翁楚良

doi:10.3969/j.issn.1000-5641.2019.05.015

华东师范大学学报（自然科学版） >

2019 , Vol. 2019 >Issue 5: 178 - 189

DOI: https://doi.org/10.3969/j.issn.1000-5641.2019.05.015

新时期数据管理技术

基于GPU的关系型流处理系统实现与优化

黄皓 ,
李志方 ,
王嘉伦 ,
翁楚良

展开

华东师范大学数据科学与工程学院, 上海 200062

黄皓,男,硕士研究生,研究方向为内存型数据库.E-mail:haohuang@stu.ecnu.edu.cn.

收稿日期: 2019-07-29

网络出版日期: 2019-10-11

基金资助

国家重点研发计划（2018YFB1003400）

收起

Implementation and optimization of GPU-based relational streaming processing systems

HUANG Hao ,
LI Zhi-fang ,
WANG Jia-lun ,
WENG Chu-liang

Expand

School of Data Science and Engineering, East China Normal University, Shanghai 200062, China

Received date: 2019-07-29

Online published: 2019-10-11

Fold

摘要

现有的基于CPU的流处理系统在功能上已支持在大规模数据集上的复杂分析查询，但由于CPU计算能力与特性的限制，无法在性能上同时满足高吞吐量和低响应时间的要求.本文提出一种基于GPU的流处理系统框架Serval，通过充分利用CPU-GPU异构资源，实现了关系型流查询的高效处理.Serval框架采用流水线模型和流执行缓存技术以优化吞吐量和响应时间，并实现多种调优策略以适应不同场景.实验表明，单节点Serval的吞吐量与响应时间性能均优于现有GPU数据库MapD和三节点分布式服务器上的Spark Streaming.

关键词： 流处理系统; 关系型查询; GPU~数据库

本文引用格式

黄皓 , 李志方 , 王嘉伦 , 翁楚良 . 基于GPU的关系型流处理系统实现与优化[J]. 华东师范大学学报（自然科学版）, 2019 , 2019(5) : 178 -189 . DOI: 10.3969/j.issn.1000-5641.2019.05.015

Abstract

State-of-the-art CPU-based streaming processing systems support complex queries on large-scale datasets. However, limited by CPU computational capability, these systems suffer from the performance tradeoff between throughput and response time, and cannot achieve the best of both. In this paper, we propose a GPU-based streaming processing system, named Serval, that co-utilizes CPU and GPU resources and efficiently processes streaming queries by micro-batching. Serval adopts the pipeline model and uses streaming execution cache to optimize throughput and response time on large scale datasets. To meet the demands of various scenarios, Serval implements multiple tuning policies by scaling the micro-batch size dynamically. Experiments show that a single-server Serval outperforms a 3-server distributed Spark Streaming by 3.87x throughput with a 91% response time on average, reflecting the efficiency of the optimization.

Key words： streaming processing system; relational query; GPU database

参考文献

[1] ZAHARIA M, DAS T, LI H Y, et al. Discretized streams:An efficient and fault-tolerant model for stream processing on large clusters[C]//Proceedings of the 4th Workshop on Hot Topics in Cloud Computing. USENIX Association, 2012.
[2] IQBAL M H, SOOMRO T R. Big data analysis:Apache storm perspective[J]. International Journal of Computer Trends and Technology, 2015, 19(1):9-14.
[3] BRESS S, KÖCHER B, HEIMEL M, et al. Ocelot/HyPE:Optimized data processing on heterogeneous hardware[J]. Proceedings of the VLDB Endowment, 2014, 7(13):1609-1612.
[4] CARBONE P, KATSIFODIMOS A, EWEN S, et al. Apache Flink:Stream and batch processing in a single engine[J]. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2015, 36(4):28-38.
[5] ZHANG S, HE J, HE B, et al. Omnidb:Towards portable and efficient query processing on parallel CPU/GPU architectures[J]. Proceedings of the VLDB Endowment, 2013, 6(12):1374-1377.
[6] CHEN C, LI K, OUYANG A, et al. GFlink:An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data[J]. IEEE Transactions on Parallel and Distributed Systems, 2018, 29(6):1275-1288.
[7] Nvidia Cooperation. CUDA C Programming Guide[R/OL].(2018-04-01)[2019-05-02]. https://docs.nvidia.com/cuda/archive/9.1/pdf/CUDACProgrammingGuide.pdf.
[8] BRESS S, HEIMEL M, SIEGMUND N, et al. GPU-accelerated database systems:Survey and open challenges[M]//Transactions on Large-Scale Data and Knowledge-Centered Systems XV. Berlin:Springer, 2014:1-35.
[9] MOSTAK T. An overview of MapD (massively parallel database)[R]. White paper. Massachusetts Institute of Technology, 2013.
[10] ROOT C, MOSTAK T. MapD:A GPU-powered big data analytics and visualization platform[C]//ACM SIGGRAPH 2016 Talks. ACM, 2016:73.
[11] Kinetica DB Inc. Kinetica high performance analytics database[EB/OL].[2019-05-11]. https://www.kinetica.com.
[12] SQream Technologies. SQream:Big Data SQL database[EB/OL].[2019-05-02]. https://sqream.com/.
[13] CHEN Z, XU J, TANG J, et al. GPU-accelerated high-throughput online stream data processing[J]. IEEE Transactions on Big Data, 2016, 4(2):191-202.
[14] CHEN C, LI K, OUYANG A, et al. GFlink:An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data[J]. IEEE Transactions on Parallel and Distributed Systems, 2018, 29(6):1275-1288.
[15] ZHANG Y, MUELLER F. GStream:A general-purpose data streaming framework on GPU clusters[C]//2011 International Conference on Parallel Processing. IEEE, 2011:245-254.
[16] KIM J, SEO S, LEE J, et al. SnuCL:An OpenCL framework for heterogeneous CPU/GPU clusters[C]//Proceedings of the 26th ACM International Conference on Supercomputing. ACM, 2012:341-352.
[17] HEWANADUNGODAGE C, XIA Y, LEE J J. GStreamMiner:A GPU-accelerated data stream mining framework[C]//Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 2016:2489-2492.
[18] HUYNH H P, HAGIESCU A, WONG W F, et al. Scalable framework for mapping streaming applications onto multi-GPU systems[C]//ACM Sigplan Notices. ACM, 2012, 47(8):1-10.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献