Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on GSM8K
Loading...
79.1
Accuracy
No Steering
21.068
36.134
51.2
66.266
Dec 7, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
No Steering
Base Model=Llama 3 8B,...
2025.12
79.1
Gradient Cuff
Base Model=Llama 3 8B,...
2025.12
78.2
Prompt guardrails
Base Model=Llama 3 8B,...
2025.12
77.1
SAE steering
Base Model=Llama 3 8B,...
2025.12
76.2
GSAE
Base Model=Llama 3 8B,...
2025.12
74.2
GSAE-1D
Base Model=Llama 3 8B,...
2025.12
74
Input Gate Only
Base Model=Llama 3 8B,...
2025.12
72.1
CAA
Base Model=Llama 3 8B,...
2025.12
67.1
No gating
Base Model=Llama 3 8B,...
2025.12
66.2
SafeSwitch
Base Model=Llama 3 8B,...
2025.12
66.1
Random graphs
Base Model=Llama 3 8B,...
2025.12
23.3
Feedback
Search any
task
Search any
task