Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-Generation Vision-Language Tasks on LLaVA-Med open-generation clients
Loading...
9.36
Average Judge Score
MoR
6.084
6.9345
7.785
8.6355
May 5, 2026
Average Judge Score
Win Rate
Visual Faithfulness
Updated 28d ago
Evaluation Results
Method
Method
Links
Average Judge Score
Win Rate
Visual Faithfulness
MoR
2026.05
9.36
98
9.25
Plural11m-alpha
2026.05
9.12
97.96
9.03
Fedavg
2026.05
6.21
85.52
5.7
Feedback
Search any
task
Search any
task