Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Temporal Reasoning on EgoAVU-Bench

67.84Accuracy

EgoAVU-Instruct (Full)

24.78435.96247.1458.318Feb 5, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
67.84
2026.02
64.31
2026.02
53.2
2026.02
46.4
2026.02
45.04
2026.02
41.22
2026.02
39.85
2026.02
37
2026.02
26.44