Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
State Transition Graph on STG Large (IID)
Loading...
97.25
Accuracy
Llama-3.2-1B
87.89
90.32
92.75
95.18
Nov 27, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Llama-3.2-1B
Architecture=AR
2025.11
97.25
Llama-3.1-8B
Architecture=AR
2025.11
96
Qwen2.5-1.5B
Architecture=AR
2025.11
95.75
C2DLM
Architecture=DLM
2025.11
93.5
LLaDA-8B-Instruct
Architecture=DLM, Trai...
2025.11
88.25
Feedback
Search any
task
Search any
task