Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General VQA on HallusionBench
Loading...
73.48
Accuracy
Gemini 3-Pro
40.8136
49.2943
57.775
66.2557
Feb 4, 2026
Feb 11, 2026
Feb 18, 2026
Feb 25, 2026
Mar 4, 2026
Mar 11, 2026
Mar 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini 3-Pro
2026.02
73.48
GPT-5
tier=High
2026.02
66.58
Qwen3-VL
mode=Thinking
2026.02
64.01
ERNIE 5.0
2026.02
63.87
Gemini 2.5-Pro
2026.02
63.7
Qwen3-VL
Language Backbone=Qwen...
2026.03
51.89
Intern3.5-VL
Language Backbone=Qwen...
2026.03
48.18
FineViT-VL
Language Backbone=Qwen...
2026.03
46.54
Aquila-VL
Language Backbone=Qwen...
2026.03
42.07
Feedback
Search any
task
Search any
task