华东师范大学学报(自然科学版) ›› 2026, Vol. 2026 ›› Issue (3): 17-29, 43.doi: 10.3969/j.issn.1000-5641.2026.03.002

• 碳循环过程与有机质特征 • 上一篇    下一篇

基于机器学习的长江溶解态有机碳输送的长期变化及驱动因素分析

鲁兴宇, 吴莹*(), 樊俊宁, 叶雨萍, 王佳   

  1. 华东师范大学 河口海岸全国重点实验室, 上海 200241
  • 收稿日期:2024-11-22 接受日期:2025-04-10 出版日期:2026-05-25 发布日期:2026-05-27
  • 通讯作者: 吴莹 E-mail:wuying@sklec.ecnu.edu.cn
  • 基金资助:
    上海市科委科技创新行动计划国际合作项目 (23230714200)

Long-term variations and driving factors of dissolved organic carbon transport in the Yangtze River using machine learning

Xingyu LU, Ying WU*(), Junning FAN, Yuping YE, Jia WANG   

  1. State Key Laboratory of Estuarine and Coastal Research, East China Normal University, Shanghai 200241, China
  • Received:2024-11-22 Accepted:2025-04-10 Online:2026-05-25 Published:2026-05-27
  • Contact: Ying WU E-mail:wuying@sklec.ecnu.edu.cn

摘要:

解析河流溶解态有机碳 (DOC) 浓度及通量变化对于刻画全球碳循环过程和细化全球碳收支计算至关重要. 然而, 中国河流的相关研究受“长时间连续观测样本少”和“控制参数时间分布不统一”等限制, 河流DOC的季节和长期变化规律及其驱动因素的影响机制尚未得到充分认识. 该研究利用长江徐六泾站长时间尺度的逐月DOC浓度实测数据和相关流域特征数据, 对比不同机器学习方法, 基于最佳模型模拟了2001—2020年徐六泾站逐月的DOC浓度变化; 并使用SHAP (SHapley Additive exPlanations) 方法分析了流域特征对DOC浓度和通量的影响. 结果表明, 随机森林算法是最佳选择 (R2=0.72, RMSE=0.09 mg·L−1). 2001—2020年间, 长江徐六泾站DOC浓度范围为1.24~2.27 mg·L−1, 平均为1.67 mg·L−1; 年DOC通量变化范围为0.93~2.41 Tg·a−1, 平均为1.46 Tg·a−1. 研究表明, 徐六泾站DOC浓度的季节性模式已从洪季低、枯季高转变为洪季和枯季均较高, 这种变化可以归因于人为水调节活动和流域生态系统格局的变化. 在长期趋势上, 徐六泾站的DOC浓度和通量均显著上升, 递增速率分别为0.026 mg·L−1·a−1, p<0.05; 0.0025 Tg·a−1, p<0.05. 其中在近十年尺度上, 人类活动的增强解释了54.1%的DOC浓度变化, 是最主要的驱动因素. 本研究将有助于进一步了解过去十几年间长江DOC浓度和通量的变化规律及其驱动因素的影响机制, 并为河流碳循环的大数据分析研究提供新的视角.

关键词: 长江, 溶解态有机碳, 机器学习, 驱动因素

Abstract:

Analyzing variations in riverine dissolved organic carbon (DOC) concentration and flux is essential for understanding global carbon cycle processes and refining carbon budget estimations. Machine learning and big data analysis have become invaluable in this field. However, research on Chinese rivers is limited due to lack of long-term continuous observational data and non-uniform temporal distribution of key influencing factors. Consequently, mechanisms driving seasonal and long-term variations in riverine DOC and their influencing factors remain unclear. This study compared various machine learning methods using long-term, monthly DOC concentration data from the Yangtze River’s Xuliujing Station, as well as watershed characteristic data. Using the optimal model, we simulated monthly DOC concentration changes at Xuliujing Station from 2001 to 2020 and employed the SHAP (SHapley Additive exPlanations) method to analyze the impact of watershed characteristics on DOC concentration and flux. The findings demonstrated that the Random Forest algorithm yielded the highest accuracy, achieving an R² of 0.72 and an RMSE of 0.09 mg·L−1. Over the study period, DOC concentrations ranged from 1.24 to 2.27 mg·L−1, with a mean of 1.67 mg·L−1. Annual DOC flux varied between 0.93 and 2.41 Tg·a−1, averaging 1.46 Tg·a−1. Notably, the seasonal pattern of DOC concentration shifted from low during the flood season and high during the dry season to high levels in both seasons. This shift was primarily due to anthropogenic water regulation activities and changes in watershed ecosystem patterns. Over the long term, both DOC concentration and flux at Xuliujing Station have significantly increased, at rates of 0.026 mg·L−1·a−1 and 0.0025 Tg·a−1, respectively (both p<0.05). Human activities were the predominant driving factor, accounting for 54.1% of the changes in DOC concentration. This study provides valuable insights into the evolving patterns of DOC concentration and flux in the Yangtze River over recent decades and the mechanisms by which driving factors influence these changes. It also provides a novel perspective for the big data analysis of riverine carbon cycling.

Key words: Yangtze River, dissolved organic carbon, machine learning, driving factors

中图分类号: