Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Embodied Question Answering on OpenEQA EM-EQA Episodes up to 32 frames
Loading...
86.8
LLM-Match Score
Human
34.488
48.069
61.65
75.231
Oct 1, 2025
LLM-Match Score
Updated 25d ago
Evaluation Results
Method
Method
Links
LLM-Match Score
Human
2025.10
86.8
Gemini 3 Flash
Query Latency [sec]=10.5
2025.10
76.8
Qwen 3.5 Plus
2025.10
74.1
Gemini 2.5 Flash
Query Latency [sec]=6.8
2025.10
69.8
VL-KnG (GER-L)
Query Latency [sec]=0.8
2025.10
55.2
VL-KnG (GER-G)
Query Latency [sec]=0.8
2025.10
54.7
VL-KnG (GR)
Query Latency [sec]=0.8
2025.10
50.7
GPT-4V
2025.10
49.6
Gemini 1.0 Pro Vision
2025.10
44.9
GPT-4 w/ ConceptGraphs
2025.10
36.5
Feedback
Search any
task
Search any
task