Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

About

Large language models (LLMs) are revolutionizing education, with LLM-based agents playing a key role in simulating student behavior. A major challenge in student simulation is modeling the diverse learning patterns of students at various cognitive levels. However, current LLMs, typically trained as ``helpful assistants'', target at generating perfect responses. As a result, they struggle to simulate students with diverse cognitive abilities, as they often produce overly advanced answers, missing the natural imperfections that characterize student learning and resulting in unrealistic simulations. To address this issue, we propose a training-free framework for student simulation. We begin by constructing a cognitive prototype for each student using a knowledge graph, which captures their understanding of concepts from past learning records. This prototype is then mapped to new tasks to predict student performance. Next, we simulate student solutions based on these predictions and iteratively refine them using a beam search method to better replicate realistic mistakes. To validate our approach, we construct the \texttt{Student\_100} dataset, consisting of $100$ students working on Python programming and $5,000$ learning records. Experimental results show that our method consistently outperforms baseline models, achieving $100\%$ improvement in simulation accuracy.

Tao Wu, Jingyuan Chen, Wang Lin, Mengze Li, Yumeng Zhu, Ang Li, Kun Kuang, Fei Wu• 2025

Related benchmarks

Task	Dataset	Result
Solution Simulation	Student 100 1.0 (test)	ROUGE-L0.3034	72
Solution Simulation	Student_15 1.0 (test)	ROUGE-L27.37	72
Student behavior and solution simulation	Student_100 1.0 (full)	Accuracy89	72
Student Simulation	C++_5	Acc86	72
Student Simulation	Java 5	Accuracy84	72
Personalized distractor generation	Eedi 100 Student 1361	Accuracy25.5	45
Personalized distractor generation	Discrete_40 Student_1361	Accuracy26.5	45
Behavior Prediction	Student_15 v1.0 (test)	Accuracy94	24
Solution Simulation	Student_15 v1.0 (test)	Con23.65	15
Solution Simulation	Human Evaluation Solution Simulation (test)	Score3.75	8

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord