Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ERQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Embodied Reasoning and Question AnsweringERQA
Score65
35
Spatial Reasoning (Multi-Image)ERQA
Accuracy51.02
23
Multimodal ReasoningERQA
Accuracy55
22
Embodied Visual Question AnsweringERQA
Accuracy59
19
Embodied reasoningERQA (test)
Accuracy70.25
12
Embodied ReasoningERQA
Accuracy54.5
11
Embodied ReasoningERQA (train)
Success Rate (SR)61.33
7
GeneralERQA
Score41.6
4
Multimodal UnderstandingErqa
Accuracy51.3
3
Ego-centric Spatial ReasoningERQA
Accuracy36.2
2
Showing 10 of 10 rows