Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Episodic-memory Question Answering on OpenEQA v1 (HM3D)
Loading...
85.1
LLM Match Score
Human baseline
21.764
38.207
54.65
71.093
Dec 17, 2025
LLM Match Score
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM Match Score
Human baseline
2025.12
85.1
R4
2025.12
76.96
GPT-4V
2025.12
46.6
AlanaVLM
2025.12
44.8
GPT-4 w/ LLaVA-1.5
Vision-Language Model=...
2025.12
40
GPT-4
2025.12
35.5
GPT-4 w/ SVM
Representation=Sparse...
2025.12
35
GPT-4 w/ CG
Representation=Concept...
2025.12
34
LLaMA-2 w/ LLaVA-1.5
Vision-Language Model=...
2025.12
31.1
LLaMA-2 w/ SVM
Representation=Sparse...
2025.12
30.9
LLaMA-2
2025.12
29
LLaMA-2 w/ CG
Representation=Concept...
2025.12
24.2
Feedback
Search any
task
Search any
task