Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Human Preference Alignment on MM-AlignBench 1.0 (test)

84.9Win Rate

Claude3.5V-Sonnet

-0.58821.60643.865.994Feb 25, 2025
Updated 3d ago

Evaluation Results

MethodLinks
84.951.47014412314
2025.02
82.5465715712311
2025.02
81.3498112412314
7739.15613814359
2025.02
72.633.54913820405
2025.02
62.319.431126196214
61.521.64311215757
500-----
2025.02
44.4-5.82884510134
44.4-6.9199389834
40.1-10.926751010041
31.3-21.818611510949
27.8-33.7185249880
26.6-2916511012154
23.8-46.21446175116
12.7-53923811696
7.5-74514363167
2.7-92.334015230