Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents
About
Large language models (LLMs) are revolutionizing education, with LLM-based agents playing a key role in simulating student behavior. A major challenge in student simulation is modeling the diverse learning patterns of students at various cognitive levels. However, current LLMs, typically trained as ``helpful assistants'', target at generating perfect responses. As a result, they struggle to simulate students with diverse cognitive abilities, as they often produce overly advanced answers, missing the natural imperfections that characterize student learning and resulting in unrealistic simulations. To address this issue, we propose a training-free framework for student simulation. We begin by constructing a cognitive prototype for each student using a knowledge graph, which captures their understanding of concepts from past learning records. This prototype is then mapped to new tasks to predict student performance. Next, we simulate student solutions based on these predictions and iteratively refine them using a beam search method to better replicate realistic mistakes. To validate our approach, we construct the \texttt{Student\_100} dataset, consisting of $100$ students working on Python programming and $5,000$ learning records. Experimental results show that our method consistently outperforms baseline models, achieving $100\%$ improvement in simulation accuracy.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Solution Simulation | Student 100 1.0 (test) | ROUGE-L0.3034 | 72 | |
| Solution Simulation | Student_15 1.0 (test) | ROUGE-L27.37 | 72 | |
| Student behavior and solution simulation | Student_100 1.0 (full) | Accuracy89 | 72 | |
| Student Simulation | C++_5 | Acc86 | 72 | |
| Student Simulation | Java 5 | Accuracy84 | 72 | |
| Behavior Prediction | Student_15 v1.0 (test) | Accuracy94 | 24 | |
| Solution Simulation | Student_15 v1.0 (test) | Con23.65 | 15 | |
| Solution Simulation | Human Evaluation Solution Simulation (test) | Score3.75 | 8 |