基于堆叠门控循环单元残差网络的知识追踪模型研究

黄彩蝶; 王昕萍; 陈良育; 刘勇

doi:10.3969/j.issn.1000-5641.2022.06.008

华东师范大学学报（自然科学版） >

2022 , Vol. 2022 >Issue 6: 68 - 78

DOI: https://doi.org/10.3969/j.issn.1000-5641.2022.06.008

计算机科学

基于堆叠门控循环单元残差网络的知识追踪模型研究

黄彩蝶 ,
王昕萍 ,
陈良育 ,
刘勇

展开

1. 华东师范大学软件工程学院, 上海　200062
2. 华东师范大学基础教育与终身教育发展部, 上海　200062

收稿日期: 2021-08-10

网络出版日期: 2022-11-22

收起

Research on a knowledge tracking model based on the stacked gated recurrent unit residual network

Caidie HUANG ,
Xinping WANG ,
Liangyu CHEN ,
Yong LIU

Expand

1. Software Engineering Institute, East China Normal University, Shanghai　200062, China
2. Basic Education and Lifelong Education Development Department, East China Normal University, Shanghai　200062, China

Received date: 2021-08-10

Online published: 2022-11-22

Fold

摘要

知识追踪任务是根据学生历史做题记录和其他辅助信息追踪学生知识水平的变化过程, 以及预测学生在下一时刻作答的结果. 由于已有的神经网络知识追踪模型在效果和性能上还有待提升, 提出了基于堆叠门控循环单元(Gated Recurrent Unit, GRU)的深度残差(Stacked-Gated Recurrent Unit-Residual, S-GRU-R)网络. 针对长短期记忆网络(Long Short-term Memory, LSTM)参数过多导致过拟合问题, 用GRU代替LSTM学习做题序列中的信息, 采用堆叠GRU扩大序列学习容量, 并用残差连接降低模型训练的难度. S-GRU-R在数据集Statics2011上进行了实验, 并用AUC (Area Under the Curve)和F₁-score作为评估指标. 结果表明S-GRU-R在这2个评估指标上都超过了其他类似的循环神经网络模型.

关键词： 深度学习; 知识追踪; 循环神经网络; 门控循环单元; 残差网络

本文引用格式

黄彩蝶 , 王昕萍 , 陈良育 , 刘勇 . 基于堆叠门控循环单元残差网络的知识追踪模型研究[J]. 华东师范大学学报（自然科学版）, 2022 , 2022(6) : 68 -78 . DOI: 10.3969/j.issn.1000-5641.2022.06.008

Abstract

The concept of knowledge tracking involves tracking changes in a student’s knowledge level based on historical question records and other auxiliary information, and predicting the result of a student’s subsequent answer to a question. Since the performance of existing neural network knowledge tracking models needs to be improved, this paper proposes a deep residual network based on a stacked gated recurrent unit (GRU) network named the stacked-gated recurrent unit-residual (S-GRU-R) network. The proposed solution aims to address over-fitting caused by too many parameters in a long short-term memory (LSTM) network; hence, the solution uses a GRU instead of LSTM to learn information on the sequence of questions. The use of stacked GRU can expand sequence learning capacity, and the use of residual connections can reduce the difficulty of model training. Experiments on the Statics2011 data set were completed using S-GRU-R, and AUC (area under the curve) and F₁-score were used as evaluation functions. The results showed that S-GRU-R surpassed other similar recurrent neural network models in these two indicators.

Key words： deep learning; knowledge tracking; recurrent neural network; gated recurrent unit; residual network

参考文献

1	朱莎, 余丽芹, 石映辉. 智能导学系统: 应用现状与发展趋势——访美国智能导学专家罗纳德·科尔教授、亚瑟·格雷泽教授和胡祥恩教授. 开放教育研究, 2017, (5): 4- 10.
2	罗照盛. 认知诊断评价理论基础 [M]. 北京: 北京师范大学出版社, 2019: 3-8.
3	PIECH C, BASSEN J, HUANG J, et al. Deep knowledge tracing [C]//Proceedings of the 28th International Conference on Neural Information Processing System (NeurIPS). Cambridge, MA: MIT Press, 2015: 505-513.
4	BAHDANAU D, CHO K, BENGIO Y. Neural machine translation by jointly learning to align and translate [EB/OL]. (2016-05-19)[2021-06-22]. http://arxiv.org/abs/1409.0473.
5	SHA L, HONG P Y. Neural knowledge tracing [C]//LNCS 10512: International Conference on Brain Function Assessment in Learning (BFAL). Berlin: Springer, 2017: 108-117.
6	刘恒宇, 张天成, 武培文, 等. 知识追踪综述. 华东师范大学学报(自然科学版), 2019, (5): 1- 15.
7	YUDELSON M V, KOEDINGER K R, GORDON G J. Individualized Bayesian knowledge tracing models [C]//International Conference on Artificial Intelligence in Education, 2013: Artificial Intelligence in Education. Berlin: Springer, 2013: 171-180.
8	CORBETT A T, ANDERSON J R. Knowledge tracing: Modeling the acquisition of procedural knowledge. User Modeling and User-Adapted Interaction, 1994, 4 (4): 253- 278.
9	DE BAKER R S J, CORBETT A T, ALEVEN V. More accurate student modeling through contextual estimation of slip and guess probabilities in Bayesian knowledge tracing [C]//International Conference on Intelligent Tutoring Systems, 2008: Intelligent Tutoring Systems. Berlin: Springer, 2008: 406–415.
10	PARDOS Z A, HEFFERNAN N T. KT-IDEM: Introducing item difficulty to the knowledge tracing model [C]// International Conference on User Modeling, Adaptation, and Personalization, 2011: User Modeling, Adaption and Personalization. Berlin: Springer, 2011: 243-254.
11	SALAKHUTDINOV R, MNIH A. Probabilistic matrix factorization [C]//Proceedings of the 20th International Conference on Neural Information Processing Systems. New York: Curran Associates Inc., 2007: 1257-1264.
12	CHEN Y, LIU Q, HUANG Z, et al. Tracking knowledge proficiency of students with educational priors [C]//Proceedings of the 26th ACM International Conference on Information and Knowledge Management (CIKM). ACM, 2017: 989-998.
13	KHAJAH M, LINDSEY R V, MOZER M C. How deep is knowledge tracing [C]//Proceedings of the 9th International Conference on Educational Data Mining (EDM). Worcester, MA: IEDMS, 2016: 94-101.
14	LEE J, YEUNG D Y. Knowledge query network for knowledge tracing: How knowledge interacts with skills [C]//Proceedings of the 9th International Conference on Learning Analytics and Knowledge (LAK). ACM, 2019: 491-500.
15	刘铁园, 陈威, 常亮, 等. 基于深度学习的知识追踪研究进展 [J]. 计算机研究与发展, 2022, 59(1): 81-104.
16	LIU D, DAI H H, ZHANG Y P, et al. Deep knowledge tracking based on attention mechanism for student performance prediction [C]//Proceedings of the 2nd International Conference on Computer Science and Educational Informatization (CSEI). IEEE, 2020: 95-98.
17	ZHANG J N, SHI X J, KING I, et al. Dynamic key-value memory networks for knowledge tracing [C]//Proceedings of the 26th International Conference on World Wide Web (WWW). ACM, 2017: 765-774.
18	AI F Z, CHEN Y S, GUO Y C, et al. Concept-aware deep knowledge tracing and exercise recommendation in an online learning system [C]//Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019). Worcester, MA: IEDMS, 2019: 240-245.
19	ABDELRAHMAN G, WANG Q. Knowledge tracing with sequential key-value memory networks [C]//Proceedings of the 42nd International Conference on Research and Development in Information Retrieval (SIGIR). ACM, 2019: 175-184.
20	PANDEY S, KARYPIS G. A self-attentive model for knowledge tracing [C]//International Conference on Education Data Mining (EDM). Montreal: Word Press, 2019: 1-6.
21	CHOI Y, LEE Y, CHO J, et al. Towards an appropriate query, key, and value computation for knowledge tracing [C]//Proceedings of the 7th ACM Conference on Learning @ Scale (L@S). ACM, 2020: 341-344.
22	PU S, YUDELSON M, OU L, et al. Deep Knowledge tracing with transformers [C]//Proceedings of the 21st International Conference on Artificial Intelligence in Education (AIED). Berlin: Springer, 2020: 252-256.
23	ZHANG L, XIONG X L, ZHAO S Y, et al. Incorporating rich features into deep knowledge tracing [C]//Proceedings of the 4th ACM Conference on Learning @ Scale (L@S). ACM, 2017: 169-172.
24	NAGATANI K, ZHANG Q, SATO M, et al. Augmenting knowledge tracing by considering forgetting behavior [C]//Proceedings of the International World Wide Web Conference. ACM, 2019: 3101-3107.
25	CHENG S, LIU Q, CHEN E H. Domain adaption for knowledge tracing [EB/OL]. (2020-01-14)[2021-06-22]. https://arxiv.org/abs/2001.04841.
26	TONG H S, ZHOU Y, WANG Z. Exercise hierarchical feature enhanced knowledge tracing [C]//Proceedings of the 21st International Conference on Artificial Intelligence in Education (AIED). Berlin: Springer, 2020: 324-328.
27	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016: 770-778. DOI: 10.1109/CVPR.2016.90.
28	HE K M, ZHANG X Y, REN S Q, et al. Identity mappings in deep residual networks [C]//European Conference on Computer Vision (ECCV). Berlin: Springer, 2016: 630-645.
29	KINGMA D P, JIMMY B. Adam: A method for stochastic optimization [EB/OL]. (2017-01-30)[2021-07-11]. https://arxiv.org/abs/1412.6980.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献