Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Understanding on Evaluation Suite Combined (held-out)
Loading...
48.4
Accuracy
SKILLRATER
44.5312
45.5356
46.54
47.5444
Feb 12, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
SKILLRATER
Training Steps=100k, M...
2026.02
48.4
SKILLRATER Combined
Training Steps=100k, M...
2026.02
47.01
DataRater
Training Steps=100k, M...
2026.02
45.89
Mammoth (Baseline)
Training Steps=100k, M...
2026.02
44.68
Feedback
Search any
task
Search any
task