Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Student Misconception Analysis on Student_1361 Python_192
Loading...
20.3
Accuracy
MCTS-guided reasoning reconstruction framework
17.388
18.144
18.9
19.656
Aug 15, 2025
Accuracy
Plausibility
Coherence
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Plausibility
Coherence
MCTS-guided reasoning reconstruction framework
Backbone=GPT-3.5-Turbo...
2025.08
20.3
2.69
2.39
CoT
Backbone=GPT-3.5-Turbo
2025.08
17.5
2.45
2.29
Feedback
Search any
task
Search any
task