基于自注意力机制的钢铁物流运力预测

doi:10.3969/j.issn.1000-5641.2022.05.014

摘要/Abstract

摘要：

运力预测在大宗物流中发挥着关键作用, 对提高运力调度与车货匹配的精准性具有重要意义. 网约车运力预测目标为预测未来时段内可用车辆的数量; 而大宗物流的运力预测任务旨在预估未来时段内不同货运流向的空闲车辆信息 (如车辆ID(Identity Document)), 这与货车是否能在预计时间内返回钢厂 (称为运力可达性) 紧密相关. 以钢铁物流为例, 需要考虑由钢厂运输货物至客户企业以及从客户企业返回钢厂这两段行程耗时的影响. 由于长途运输过程中货车需要多次停留但停留时长不等, 停留时间的不确定使准确预测运输送达时间面临挑战; 此外, 网络货运平台仅对钢厂的货运任务进行运力指派, 货车返程货源则由司机自行联系确定, 导致返程轨迹缺失, 为预测货车返回钢厂的时间带来挑战. 为解决上述挑战, 基于物流企业的运单、车辆、轨迹以及运输终点等数据集, 提取货车的停留行为特征、运输终点特征、环境特征等, 并引入自注意力机制分别获取不同特征对两段行程耗时影响的权重, 进一步提升运力可达性预测的精度. 在此基础上, 提出了基于自注意力机制的运力预测方法, 包括基于历史流向相似性的运力候选集生成、基于自注意力机制的运力可达性预测、基于长短期记忆网络 (Long Short-Term Memory, LSTM) 模型的运力承运流向预测等3个部分. 最后, 在真实数据集上进行了大量对比实验. 实验结果表明, 所提方法具有更高的预测精度, 能为大宗物流的运力调度优化等任务提供强有力的决策支持.

关键词: 运力预测, 流向相似性, 可达性预测

Abstract:

Capacity prediction plays an important role in smart logistics, and its results are important for improving the accuracy of capacity scheduling and truck-cargo matching. Existing researches on capacity prediction in urban road networks aim to determine the number of available vehicles in future periods, while the problem of capacity prediction in bulk logistics aims at predicting the information on the trucks (e.g. the truck’s identity document (ID)) to carry certain types of goods for different flows, which is closely related to whether the trucks can return to the steel plant within the expected time (called capacity accessibility). In the case of bulk logistics, it is necessary to take into account the impact of the time spent on the two trips from the steel plant to the customer’s business and back to the steel plant. Since trucks need to stop several times in the long-distance transportation process but the length of stopping time varies, the uncertainty of stopping time makes the accurate prediction of transportation delivery time difficult. In addition, the freight platform only assigns capacity to one-way transport tasks (i.e. from the steel plant to the customer’s business), and the return trip (i.e. back to the steel plant) is determined by the truck drivers, which leads to the lack of return trajectory and poses a challenge to predict the return time of trucks to the steel plant. In order to solve the above challenges, based on the data sets of waybills, trucks, trajectories, and transport endpoints of logistics enterprises, we extract the stay behavior features, transport endpoint features, and environmental features. Then, the self-attention mechanism is introduced to obtain the weights of different features on the time consumption of two trips respectively to further improve the accuracy of capacity accessibility prediction. On this basis, a truck capacity prediction method based on self-attention mechanism is proposed, including capacity candidate set generation based on historical flow similarity, capacity accessibility prediction based on self-attention mechanism, and capacity carrier flow prediction based on long short-term memory (LSTM). Finally, the experimental results of comparison experiments on real logistics datasets show that the proposed method has higher prediction accuracy and can provide powerful decision support for the optimization of capacity scheduling in bulk logistics.

Key words: capacity prediction, flow similarity, accessibility prediction

中图分类号:

TP311

苗晓变, 廖家俊, 梅华杰, 冯冲, 毛嘉莉. 基于自注意力机制的钢铁物流运力预测[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 165-183.

Xiaobian MIAO, Jiajun LIAO, Huajie MEI, Chong FENG, Jiali MAO. Truck capacity prediction based on self-attention mechanism in the bulk logistic industry[J]. Journal of East China Normal University(Natural Science), 2022, 2022(5): 165-183.

图/表 14

图1

图2

图3

图4

图5

表1

特征描述"

特征类型	特征名称	特征符号	特征描述
运力特征	车辆ID	$ {c}_{\mathrm{i}\mathrm{d}} $	承运车辆
	历史平均停留次数	$ {c}_{\mathrm{s}\mathrm{t}\mathrm{a}\mathrm{y}\mathrm{T}\mathrm{i}\mathrm{m}\mathrm{e}} $	承运车辆历史运输中平均停留的次数
	历史平均停留时间	$ {c}_{\mathrm{s}\mathrm{t}\mathrm{a}\mathrm{y}\mathrm{D}\mathrm{u}\mathrm{r}\mathrm{a}} $	承运车辆历史运输中平均停留的时长
流向客户特征	客户地平均路径距离	$ {e}_{\mathrm{d}\mathrm{i}\mathrm{f}\mathrm{f}} $	客户地的多次历史运输中钢厂与客户地之间的路径距离 (km)
	客户收货时间	$ {e}_{\mathrm{t}\mathrm{i}\mathrm{m}\mathrm{e}} $	客户开始收货的时间 (从轨迹数据中提取) , 取值范围为[0,23]
	平均卸货时长	$ {e}_{\mathrm{d}\mathrm{u}\mathrm{r}\mathrm{a}} $	到达该客户地之后所需的卸货时长 (车辆轨迹中提取历史运输中在该客户地附近的平均等待时长)
货物特征	货物品种	$ {m}_{\mathrm{c}\mathrm{a}\mathrm{t}\mathrm{e}} $	运输的货物品种
货物特征	是否卷类	$ {m}_{\mathrm{r}\mathrm{o}\mathrm{l}\mathrm{l}} $	运输的货物是否为钢卷, 取值范围{0,1}, 其中0表示否, 1表示是
时间特征	出发时间	$ {t}_{\mathrm{s}\mathrm{t}\mathrm{a}\mathrm{r}\mathrm{t}} $	运输开始小时, 取值范围为[0,23]
	出发日期	$ {d}_{\mathrm{s}\mathrm{t}\mathrm{a}\mathrm{r}\mathrm{t}} $	运输开始日期, 取值范围为[1,31]
	车辆历史平均返程时间	$ {t}_{\mathrm{r}\mathrm{e}\mathrm{t}\mathrm{u}\mathrm{r}\mathrm{n}} $	承运车辆历史运输中返程所需的平均时长
	车辆历史平均运输周期	$ {t}_{\mathrm{p}\mathrm{e}\mathrm{r}\mathrm{i}\mathrm{o}\mathrm{d}} $	承运车辆历史运输中平均运输周期 (两次连续运输之间的时间间隔)
环境特征	天气	$ w $	出发时间后3 d内是否下雨, 取值范围{0,1}, 其中0表示否, 1表示是

表1

图6

图7

图8

图11

图9

图10

表2

图12

参考文献 24

1	ZHOU W, YANG Y, ZHANG Y, et al. Deep flexible structure spatial-temporal model for taxi capacity prediction. Knowledge-Based Systems, 2020, 205, 106286.
2	WONG R C P, SZETO W Y, WONG S C. A two-stage approach to modeling vacant taxi movements. Transportation Research Procedia, 2015, (7): 147- 163.
3	JINDAL I, QIN Z W, CHEN X W, et al. A unified neural network approach for estimating travel time and distance for a taxi trip [EB/OL]. (2017-10-12)[2022-06-22]. https://arxiv.org/pdf/1710.04350.pdf
4	LI Y G, FU K, WANG Z, et al. Multi-task representation learning for travel time estimation [C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2018: 1695-1704.
5	WANG H J, TANG X F, KUO Y H, et al. A simple baseline for travel time estimation using large-scale trip data. ACM Transactions on Intelligent Systems and Technology (TIST), 2019, 10 (2): 19.
6	YANG Z C, YANG D Y, DYER C, et al. Hierarchical attention networks for document classification [C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016: 1480-1489.
7	HOCHREITER S, SCHMIDHUBER J. Long short-term memory. Neural Computation, 1997, 9 (8): 1735- 1780.
8	KAMARIANAKIS Y, PRASTACOS P. Space-time modeling of traffic flow. Computers and Geosciences, 2005, 31 (2): 119- 133.
9	CASTRO-NETO M, JEONG Y S, JEONG M K, et al. Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions. Expert Systems with Applications, 2009, 36 (3): 6164- 6173.
10	LESHEM G, RITOV Y. Traffic flow prediction using adaboost algorithm with random forests as a weak learner [J]. International Journal of Electrical and Computer Engineering, 2007, 2(2): 111-116.
11	ZHANG J B, ZHENG Y, QI D K. Deep spatio-temporal residual networks for citywide crowd flows prediction [C]// 31st AAAI Conference on Artificial Intelligence. 2017: 1655-1661.
12	WANG D, CAO W, LI J, et al. DeepSD: Supply-demand prediction for online car-hailing services using deep neural networks [C]// 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 2017: 243-254. DOI: 10.1109/ICDE.2017.83.
13	KE J T, YANG H, ZHENG H Y, et al. Hexagon-based convolutional neural network for supply-demand forecasting of ride-sourcing services. IEEE Transactions on Intelligent Transportation Systems, 2019, 20 (11): 4160- 4173.
14	JENELIUS E, KOUTSOPOULOS H N. Travel time estimation for urban road networks using low frequency probe vehicle data. Transportation Research Part B: Methodological, 2013, 53, 64- 81.
15	WANG Z, FU K, YE J P. Learning to estimate the travel time [C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2018: 858-866.
16	YAN S, CHEN X, HUO R, et al. Learning to build user-tag profile in recommendation system [C]// Proceedings of the 29th ACM International Conference on Information & Knowledge Management. ACM, 2020: 2877-2884.
17	BENGIO Y, SIMARD P, FRASCONI P. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 1994, 5 (2): 157- 166.
18	WU C H, HO J M, LEE D T. Travel-time prediction with support vector regression. IEEE Transactions on Intelligent Transportation Systems, 2004, 5 (4): 276- 281.
19	BREIMAN L. Random forests. Machine Learning, 2001, 45 (1): 5- 32.
20	WANG D, ZHANG J B, CAO W, et al. When will you arrive? Estimating travel time based on deep neural networks [C]// Proceedings of the AAAI Conference on Artificial Intelligence. 2018: 2500-2507.
21	GUO G D, WANG H, BELL D, et al. kNN model-based approach in classification [C]// On the Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE, OTM 2003, Lecture Notes in Computer Science, vol 2888. Berlin: Springer, 2003: 986-996.
22	朱军, 胡文波. 贝叶斯机器学习前沿进展综述. 计算机研究与发展, 2015, 52 (1): 16- 26.
23	FRIEDMAN J H. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 2001, 1189- 1232.
24	王黎明, 王连, 杨楠. 应用时间序列分析 [M]. 上海: 复旦大学出版社, 2008.

评价指标	无车辆特征	无货物特征	无终点特征	无时间特征	无环境特征	全量特征
${A}_{ }$	0.8247	0.753 0	0.7518	0.7353	0.7529	0.7765
${P}_{ }$	0.4413	0.3711	0.3661	0.331 0	0.4099	0.4002
${R}_{ }$	0.1718	0.329 0	0.4375	0.4591	0.3888	0.4814
${F_1}{\text{-}}{\rm{score} }$	0.2473	0.3487	0.3986	0.3846	0.3990	0.4371

[1]	张枨宇, 诸嘉逸, 黄怿豪, 杨迪, 李建文, 缪炜恺, 阎迪, 顾斌, 詹乃军, 蒲戈光. 一种基于机器学习的模型检查算法性能预测方法[J]. 华东师范大学学报（自然科学版）, 2024, 2024(4): 18-29.
[2]	陈杰, 沈文怡, 吴问宇, 毛嘉莉. 面向骑行地图推断的轨迹数据质量提升方法[J]. 华东师范大学学报（自然科学版）, 2023, 2023(6): 14-27.
[3]	武朝阳, 毛嘉莉. 基于神经网络的行驶时长预测[J]. 华东师范大学学报（自然科学版）, 2023, 2023(2): 106-118.
[4]	郁毅明, 洪语晨, 王晔, 董启文. 化工材料配方的实验数据治理模块设计[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 1-13.
[5]	李继玲, 李宝林, 严宋如. 疫情背景下快递物流服务的用户行为画像及主题挖掘研究[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 100-114.
[6]	潘晓, 鹿冬娜, 王书海. 基于订单拆分的容量限制商超配送路径规划[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 147-164.
[7]	邹韬, 钱荣涛, 毛嘉莉. 基于钢铁物流数据的索引与查询技术研究[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 195-207.
[8]	孙晴, 梁冠宇, 武延军, 武斌, 田春岐, 王伟. 数据驱动的开源软件供应链可维护性风险分析方法[J]. 华东师范大学学报（自然科学版）, 2022, 2022(5): 90-99.
[9]	龚鑫, 徐立华, 窦亮, 赵瑞祥. 金融科技软件自动化测试用例的冗余评价和削减方法[J]. 华东师范大学学报（自然科学版）, 2022, 2022(4): 43-55.
[10]	于萍, 胡卉芪, 钱卫宁. 基于遗传算法的多目标货物配载研究[J]. 华东师范大学学报（自然科学版）, 2021, 2021(5): 185-198.
[11]	马晓琴, 郭小鹤, 薛峪峰, 杨琳, 陈远哲. 针对命名实体识别的数据增强技术[J]. 华东师范大学学报（自然科学版）, 2021, 2021(5): 14-23.
[12]	纪宇, 何一璇, 吴国群, 吴敏. 基于Prony-like方法的第一类贝塞尔函数逼近[J]. 华东师范大学学报(自然科学版), 2019, 2019(6): 42-60.
[13]	谢青成, 毛嘉莉, 刘婷. 城市共享单车的动态调度策略[J]. 华东师范大学学报(自然科学版), 2019, 2019(6): 88-102.
[14]	申航杰, 琚生根, 孙界平. 基于模糊聚类和支持向量回归的成绩预测[J]. 华东师范大学学报(自然科学版), 2019, 2019(5): 66-73,84.
[15]	江群, 戴戈南, 张森, 葛又铭, 刘玉葆. 基于用户偏好的最优路径搜索[J]. 华东师范大学学报(自然科学版), 2019, 2019(5): 100-112.