Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Group-level Distractor Generation on Eedi Elementary Math 100
Loading...
31.66
Recall
DiVERT
20.0328
23.0514
26.07
29.0886
Aug 15, 2025
Recall
Updated 1mo ago
Evaluation Results
Method
Method
Links
Recall
DiVERT
2025.08
31.66
Deepseek-V3
Framework=MCTS-guided...
2025.08
31.1
Claude-4-Sonnet
Framework=MCTS-guided...
2025.08
28.31
GPT-4o
Framework=MCTS-guided...
2025.08
27.37
O&A
2025.08
26.63
GPT-3.5-turbo
Framework=MCTS-guided...
2025.08
24.39
LLaMA-3-70B
Framework=MCTS-guided...
2025.08
22.16
D-GEN
2025.08
20.48
Feedback
Search any
task
Search any
task