Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Understanding on VQAv2, NLVR2, MME (held-out)
Loading...
71.54
Accuracy
SKILLRATER
65.6848
67.2049
68.725
70.2451
Feb 12, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
SKILLRATER
Training Steps=100k, M...
2026.02
71.54
SKILLRATER Combined
Training Steps=100k, M...
2026.02
71.36
DataRater
Training Steps=100k, M...
2026.02
68.33
Mammoth (Baseline)
Training Steps=100k, M...
2026.02
65.91
Feedback
Search any
task
Search any
task