Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue reasoning on IndiRef
Loading...
50
Temporal Accuracy
Agentic-Image
16.72
25.36
34
42.64
Apr 22, 2026
Temporal Accuracy
Spatial Accuracy
Attributive Accuracy
Inferred Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Temporal Accuracy
Spatial Accuracy
Attributive Accuracy
Inferred Accuracy
Agentic-Image
2026.04
50
24
44
58
Agentic-Text
2026.04
42
26
44
46
Qwen-QwQ
Framework=Full Dialog...
2026.04
32
38
40
40
Qwen3-VL-Thinking
Framework=Full Dialog...
2026.04
18
10
20
24
Feedback
Search any
task
Search any
task