Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Motion Understanding and Temporal Reasoning on Tomato (test)

53Accuracy

GPT-5

17.1226.43535.7545.065Feb 13, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
53
2026.02
48.6
2026.02
48.3
2026.02
39.6
2026.02
28.3
2026.02
27
2026.02
25.5
2026.02
24.9
2026.02
21.7
2026.02
18.5