Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
State Transition Graph on STG Small (IID)
Loading...
90.5
Accuracy
Llama-3.1-8B
81.14
83.57
86
88.43
Nov 27, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Llama-3.1-8B
Architecture=AR
2025.11
90.5
C2DLM
Architecture=DLM
2025.11
88
LLaDA-8B-Instruct
Architecture=DLM, Trai...
2025.11
86.5
Llama-3.2-1B
Architecture=AR
2025.11
83.5
Qwen2.5-1.5B
Architecture=AR
2025.11
81.5
Feedback
Search any
task
Search any
task