Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fact-based Question Answering on FVQA (test)
Loading...
70.1
Accuracy
MM-DeepResearch 32B
22.364
34.757
47.15
59.543
Mar 1, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
MM-DeepResearch 32B
Evaluation Paradigm=Ag...
2026.03
70.1
MM-DeepResearch-8B
Evaluation Paradigm=Ag...
2026.03
69.2
SenseNova-MARS-8B
Evaluation Paradigm=Ag...
2026.03
67.1
GPT-4o
Evaluation Paradigm=RA...
2026.03
66.3
GPT-5
Evaluation Paradigm=RA...
2026.03
62.6
MM-DeepResearch-7B
Evaluation Paradigm=Ag...
2026.03
61.9
DeepEyes-v2-7B
Evaluation Paradigm=Ag...
2026.03
60.6
Qwen3-VL-32B
Evaluation Paradigm=Ag...
2026.03
60.2
Qwen3-VL-8B
Evaluation Paradigm=Ag...
2026.03
58.7
MMSearch-R1-7B
Evaluation Paradigm=Ag...
2026.03
58.4
GPT-5
Evaluation Paradigm=Di...
2026.03
54.4
Qwen3-VL-8B
Evaluation Paradigm=RA...
2026.03
53.6
GPT-4o
Evaluation Paradigm=Di...
2026.03
48
Visual-ARFT-7B
Evaluation Paradigm=Ag...
2026.03
41.7
Qwen3-VL-32B
Evaluation Paradigm=Di...
2026.03
34.1
Qwen3-VL-8B
Evaluation Paradigm=Di...
2026.03
24.2
Feedback
Search any
task
Search any
task