Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

About

Large language models (LLMs) are revolutionizing education, with LLM-based agents playing a key role in simulating student behavior. A major challenge in student simulation is modeling the diverse learning patterns of students at various cognitive levels. However, current LLMs, typically trained as ``helpful assistants'', target at generating perfect responses. As a result, they struggle to simulate students with diverse cognitive abilities, as they often produce overly advanced answers, missing the natural imperfections that characterize student learning and resulting in unrealistic simulations. To address this issue, we propose a training-free framework for student simulation. We begin by constructing a cognitive prototype for each student using a knowledge graph, which captures their understanding of concepts from past learning records. This prototype is then mapped to new tasks to predict student performance. Next, we simulate student solutions based on these predictions and iteratively refine them using a beam search method to better replicate realistic mistakes. To validate our approach, we construct the \texttt{Student\_100} dataset, consisting of $100$ students working on Python programming and $5,000$ learning records. Experimental results show that our method consistently outperforms baseline models, achieving $100\%$ improvement in simulation accuracy.

Tao Wu, Jingyuan Chen, Wang Lin, Mengze Li, Yumeng Zhu, Ang Li, Kun Kuang, Fei Wu• 2025

Related benchmarks

TaskDatasetResultRank
Solution SimulationStudent 100 1.0 (test)
ROUGE-L0.3034
72
Solution SimulationStudent_15 1.0 (test)
ROUGE-L27.37
72
Student behavior and solution simulationStudent_100 1.0 (full)
Accuracy89
72
Student SimulationC++_5
Acc86
72
Student SimulationJava 5
Accuracy84
72
Behavior PredictionStudent_15 v1.0 (test)
Accuracy94
24
Solution SimulationStudent_15 v1.0 (test)
Con23.65
15
Solution SimulationHuman Evaluation Solution Simulation (test)
Score3.75
8
Showing 8 of 8 rows

Other info

Follow for update