Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spatial-Temporal Reasoning on MMSI

41.8Accuracy

GPT-5

23.80828.47933.1537.821Feb 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
41.8
36.9
2026.02
30.8
2026.02
30.3
2026.02
28.7
2026.02
28.5
2026.02
26.8
2026.02
25.9
2026.02
24.5