Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Goal Prediction on EgoToM (test)
Loading...
78.2
True Rate
Qwen2.5-VL-7B
5.712
24.531
43.35
62.169
Mar 25, 2026
True Rate
Info Rate
True AND Info Rate
Updated 2mo ago
Evaluation Results
Method
Method
Links
True Rate
Info Rate
True AND Info Rate
Qwen2.5-VL-7B
Intervention Setting=+αΔ
2026.03
78.2
45.9
35.4
Qwen2.5-VL-7B
Intervention Setting=B...
2026.03
76.1
14.2
9.4
Gemini-2.5-Flash
Intervention Setting=B...
2026.03
75.5
35.3
20.2
LLaVA-Next-Video-7B
Intervention Setting=+αΔ
2026.03
27.3
99.9
27.2
LLaVA-Next-Video-7B
Intervention Setting=+αΔ
2026.03
25.9
99.5
25.8
Qwen2.5-VL-7B
Intervention Setting=+αΔ
2026.03
24
95.7
22.7
Gemini-2.5-Flash
Intervention Setting=B...
2026.03
21.8
100
21.8
Qwen2.5-VL-7B
Intervention Setting=B...
2026.03
19.2
98.9
18.6
LLaVA-Next-Video-7B
Intervention Setting=B...
2026.03
14.4
99.7
14.4
LLaVA-Next-Video-7B
Intervention Setting=B...
2026.03
8.5
100
8.5
Feedback
Search any
task
Search any
task