Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on OBQA
Loading...
30
Accuracy
Self-Improving Pretraining
26.464
27.382
28.3
29.218
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Self-Improving Pretraining
Training Dataset=SlimP...
2026.01
30
Self-Improving Pretraining
Training Dataset=SlimP...
2026.01
29
Self-Improving Pretraining
Training Dataset=RedPa...
2026.01
27.4
Llama Base
Training Dataset=Origi...
2026.01
27.2
Llama Pretrain Baseline
Training Dataset=SlimP...
2026.01
27
Llama Pretrain Baseline
Training Dataset=RedPa...
2026.01
26.6
Feedback
Search any
task
Search any
task