Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Research on In-house 1
Loading...
52.5
Accuracy
Gemini-3-Flash
7.78
19.39
31
42.61
Apr 6, 2026
Accuracy
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-3-Flash
Agent Type=Direct Answer
2026.04
52.5
GPT-5.4
Agent Type=Direct Answer
2026.04
45.1
MIA
Agent Type=Memory-base...
2026.04
31.8
Gemini-2.5-Pro
Agent Type=Direct Answer
2026.04
30.8
Unsupervised MIA
Agent Type=Memory-base...
2026.04
29.8
Qwen2.5-VL-32B+ReACT
Agent Type=Search Agent
2026.04
28.8
GPT-4o
Agent Type=Direct Answer
2026.04
25.6
Memento
Agent Type=Memory-base...
2026.04
22.7
ExpeL
Agent Type=Memory-base...
2026.04
19.7
Qwen2.5-VL-32B
Agent Type=Direct Answer
2026.04
18.6
ReasoningBank
Agent Type=Memory-base...
2026.04
18.6
No Memory
Agent Type=Memory-base...
2026.04
15.9
MMSearch-R1
Agent Type=Search Agent
2026.04
13.6
RAG
Agent Type=Memory-base...
2026.04
12.5
Mem0
Agent Type=Memory-base...
2026.04
12.5
A-Mem
Agent Type=Memory-base...
2026.04
12.5
Qwen2.5-VL-7B
Agent Type=Direct Answer
2026.04
9.5
Qwen2.5-VL-7B+ReACT
Agent Type=Search Agent
2026.04
9.5
Feedback
Search any
task
Search any
task