物流时空数据分析与智能优化理论

基于查询代价的两级轨迹数据划分算法

  • 刘梦男 ,
  • 许建秋
展开
  • 南京航空航天大学 计算机科学与技术学院, 南京 211106

收稿日期: 2022-06-22

  网络出版日期: 2022-09-26

基金资助

国家自然科学基金 (61972198); 江苏省自然科学基金 (BK20191273)  

Two-level trajectory data partition algorithm based on query cost

  • Mengnan LIU ,
  • Jianqiu XU
Expand
  • College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

Received date: 2022-06-22

  Online published: 2022-09-26

摘要

轨迹数据具有规模大、更新频繁的特点, 对轨迹数据的查询具有较高的性能要求. 为了提高轨迹数据的查询效率, 提出了两级轨迹数据划分算法: 在第一级划分中, 使用基于优化最小边界矩形(Minimum Bounding Rectangle, MBR)的轨迹数据划分方法将轨迹数据划分为子轨迹, 以提高轨迹数据的近似效果; 在第二级划分中, 按照时空范围, 使用网格结构对子轨迹进行分组. 基于划分算法提出了R-tree结点组织方法, 将划分后的轨迹数据自底向上地构建R-tree. 通过实验展示了所提的划分算法对查询效率的提升. 实验表明, 与基于轨迹段平均个数和基于组合运动特征这两种轨迹数据划分算法相比, 所提算法具有更好的查询性能, 查询效率分别平均提升了43.0%和30.5%.

本文引用格式

刘梦男 , 许建秋 . 基于查询代价的两级轨迹数据划分算法[J]. 华东师范大学学报(自然科学版), 2022 , 2022(5) : 184 -194 . DOI: 10.3969/j.issn.1000-5641.2022.05.015

Abstract

Trajectory data are typically large in scale and require frequent updates; hence, there are high performance requirements for trajectory queries. In order to improve the query efficiency of trajectory data, a two-level trajectory partition algorithm is presented herein. In the first partition, trajectory data was divided into sub-trajectories based on optimized minimum bounding rectangle (MBR) to improve the approximate effect of trajectory data. In the second partition, the sub-trajectories were grouped by the grid structure according to spatio-temporal characteristics. A packing method of R-tree was proposed based on the partition algorithm, and the divided trajectory data was packed into the R-tree from bottom to top. Finally, compared with a method based on the average number or average size of trajectory segments, experimental results show that the proposed method offers better query performance than the two other methods based on the average number of trajectory segments and combined movement features; in fact, the query efficiency is improved by 43% and 30.5%, respectively.

参考文献

1 张林, 汤大权, 张翀. 时空索引的演变与发展. 计算机科学, 2010, 37 (4): 15- 20,26.
2 赵馨逸, 黄向东, 乔嘉林, 等. 基于不均匀空间划分和 R 树的时空索引. 计算机研究与发展, 2019, 56 (3): 666- 676.
3 CHAKKA P V, EVERSPAUGH A C, PATEL J M. Indexing large trajectory data sets with SETI [C/OL]// First Biennial Conference on Innovative Data Systems Research, CIDR 2003. (2003-01-05)[2022-06-09]. https://www.cidrdb.org/.
4 GUTTMAN A. R-trees: A dynamic index structure for spatial searching [G]// Readings in Database Systems. San Francisco: Morgan Kaufmann Publishers Inc., 1994: 125-135.
5 YUE Z W, ZHANG J W, ZHANG H B, et al. Time-based trajectory data partitioning for efficient range query [C]// International Conference on Database Systems for Advanced Applications. Cham: Springer, 2018: 24-35.
6 TANG J, LIU L F, WU J G. A trajectory partition method based on combined movement features [J]. Wireless Communications and Mobile Computing, 2019, 2019: 7803293. DOI: 10.1155/2019/7803293.
7 MOUASVI S M, HARWOOD A, KARUNASEKERA S, et al. Geometry of interest (GOI): Spatio-temporal destination extraction and partitioning in GPS trajectory data. Journal of Ambient Intelligence and Humanized Computing, 2017, 8 (3): 419- 434.
8 LEE J G, HAN J, WHANG K Y. Trajectory clustering: a partition-and-group framework [C]// Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM, 2007: 593-604.
9 MASCIARI E. Trajectory clustering via effective partitioning [C]// International Conference on Flexible Query Answering Systems. Berlin: Springer, 2009: 358-370.
10 HADJIELEFTHERIOU M, KOLLIOS G, TSOTRAS V J, et al. Efficient indexing of spatiotemporal objects [C]// International Conference on Extending Database Technology. Berlin: Springer, 2002: 251-268.
11 RASETIC S, SANDER J, ELDING J, et al. A trajectory splitting model for efficient spatio-temporal indexing [C]// VLDB’05: Proceedings of the 31st International Conference on Very Large Data Bases. VLDB Endowment, 2005: 934-945.
12 THEODORIDIS Y, SELLIS T. A model for the prediction of R-tree performance [C]// Proceedings of the 15th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM, 1996: 161-171.
13 李松, 崔环宇, 张丽平, 等. 基于CURE聚类算法的静态R树构建方法[J]. 计算机科学, 2015, 42(10): 193-197.
14 LEUTENEGGER S T , LOPEZ M A , EDGINGTON J. STR: A simple and efficient algorithm for R-tree packing [C]// Proceedings 13th International Conference on Data Engineering. IEEE, 1997: 497-506. DOI: 10.1109/ICDE.1997.582015.
15 BECKMANN N, KRIEGEL H P, SCHNEIDER R, et al. The R*-tree: An efficient and robust access method for points and rectangles [C]// SIGMOD Conference. ACM, 1990: 322-331.
16 GüTING R H, BEHR T, DüNTGEN C. SECONDO: A platform for moving objects database research and for publishing and integrating research implementations [G]// Bulletin of the IEEE Computer Society Technical Committee on Data Engineering. IEEE, 2010: 56-63.
文章导航

/