Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RoboVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Robotic Video Question AnsweringRoboVQA
Score64.6
14
Embodied Question AnsweringRoboVQA
BLEU-177.1
13
Temporal Task PlanningRoboVQA
Score74.5
11
Long-horizon reasoning for robotic manipulationRoboVQA
B-1 Score70.1
10
Robotic Video Question AnsweringRoboVQA (test)
BLEU-172.7
6
Embodied QARoboVQA (test)
BLEU-177.1
5
Robotic ReasoningRoboVQA (val)
BLEU-442.8
4
Showing 7 of 7 rows