Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice Answering on VIEW2SPACE
Loading...
93.57
Accuracy (%)
Human-max
25.9908
43.5354
61.08
78.6246
Mar 17, 2026
Accuracy (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (%)
Human-max
2026.03
93.57
Human-avg
2026.03
89.88
Human-min
2026.03
85
Ours (Grounded CoT)
Model Type=Fine-tuned...
2026.03
64.93
GPT-5
2026.03
59.86
Qwen3-VL-4B
2026.03
35.19
Random (chance)
2026.03
28.59
Random (frequency)
2026.03
28.59
Feedback
Search any
task
Search any
task