Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Motion Understanding and Temporal Reasoning on Tomato (test)

53Accuracy

GPT-5

17.1226.43535.7545.065Feb 13, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
53
2026.02
48.6
2026.02
48.3
2026.02
39.6
2026.02
28.3
2026.02
27
2026.02
25.5
2026.02
24.9
2026.02
21.7
2026.02
18.5