Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Open-ended Utility Evaluation on MM-Vet v2 (test)
Loading...
66.3
Utility Score
No Steering
58.188
60.294
62.4
64.506
Apr 10, 2026
Utility Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Utility Score
No Steering
Steering Method=None
2026.04
66.3
DACO
Steering Method=DACO
2026.04
63.7
Prompting
Steering Method=Prompting
2026.04
63
MOP
Steering Method=MOP
2026.04
61.1
ActAdd
Steering Method=Activa...
2026.04
58.5
Feedback
Search any
task
Search any
task