Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Captioning on RoboFine-Bench Hard setting
Loading...
83.6
Overall Score
RoboFine-VLM
64.36
69.355
74.35
79.345
May 26, 2026
Overall Score
Consistency Score
Coverage Score
Anti-Hallucination Score
Updated 7d ago
Evaluation Results
Method
Method
Links
Overall Score
Consistency Score
Coverage Score
Anti-Hallucination Score
RoboFine-VLM
2026.05
83.6
81.9
75.3
93.7
GPT-5.4
2026.05
78.1
74.2
68.9
91.1
Gemini-3.1-Pro
2026.05
77.2
77
61.3
93.4
Qwen3.5-Plus
2026.05
72.5
70.9
56.8
89.7
Doubao-Seed-2.0-Pro
2026.05
68.2
72.2
65.6
66.8
Qwen3-VL-Plus
2026.05
65.1
68.7
57
69.6
Feedback
Search any
task
Search any
task