Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Spatio-Temporal Reasoning on V-STaR

60.8Accuracy

GPT-4o

14.62426.61238.650.588Dec 7, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
60.810.416.72.86.51.231012.826.838.2
2025.12
5517.419.920.122.28.29.515.723.430.840.6
2025.12
54.58.911.58.413.61.425.47.62432.4
5315.924.50.94.60.62.215.223.826.935.6
49.56.310.50.21.90.31.35.512.220.827.3
2025.12
44.24.98.700.700.13.87.817.624.9
41.919.8230.10.900.220.423.121.727
2025.12
41.510.917.100.21002.15.9617.722
2025.12
36.213.113.70.12.50.3112.112.51720.3
2025.12
34.319.727.540.337.244.541.716.72533.240.7
2025.12
26.48.712.01----8.513.613.114.8
2025.12
20.54.513.52.210.113.55.614.815.113.8
2025.12
17.614.119.1----1217.11213.3
2025.12
16.400.134.1832.340.437.50017.120.3