Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Maximization on SHP
Loading...
0.53
Win Rate
AISP
0.27
0.3375
0.405
0.4725
Oct 30, 2025
Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
AISP
Base Model=Gemma3 4B,...
2025.10
0.53
AISP
Base Model=Gemma3 4B,...
2025.10
0.527
AISP
Base Model=Llama3 8B,...
2025.10
0.513
AISP
Base Model=Llama3 8B,...
2025.10
0.47
BoN (top-p)
Base Model=Llama3 8B,...
2025.10
0.453
BoN (top-p)
Base Model=Llama3 8B,...
2025.10
0.42
BoN (top-p)
Base Model=Gemma3 4B,...
2025.10
0.413
BoN (top-p)
Base Model=Gemma3 4B,...
2025.10
0.39
AISP
Base Model=Vicuna 7B,...
2025.10
0.36
AISP
Base Model=Vicuna 7B,...
2025.10
0.353
BoN (top-p)
Base Model=Vicuna 7B,...
2025.10
0.343
BoN (top-p)
Base Model=Vicuna 7B,...
2025.10
0.28
Feedback
Search any
task
Search any
task