Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WildVision

Benchmarks

Task NameDataset NameSOTA ResultTrend
Real-world UnderstandingWildVision
Win Rate80.6
17
Human PreferencesWildVision 0617
Score89.4
14
Pointwise ScoringWildVision (pointwise)
Kendall's Tau0.949
9
Multi-modal preference alignmentWildVision
Winning Rate40.2
6
Multi-modal ChatWildVision 0617 (test)
General Score89.2
4
Showing 5 of 5 rows