Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on ANLI R3
Loading...
48.25
Accuracy
GANPO
47.73
47.865
48
48.135
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GANPO
Backbone Model=Gemma2-...
2026.01
48.25
DPO
Backbone Model=Gemma2-...
2026.01
47.92
Base
Backbone Model=Gemma2-...
2026.01
47.75
Feedback
Search any
task
Search any
task