Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Vision-Language Spatial Reasoning on WhatsUp A-LR 2x2 directional variants
Loading...
95.87
GroupScore
TTM
-3.8348
22.0501
47.935
73.8199
Oct 9, 2025
GroupScore
Absolute Gain (Δ)
Relative Gain
Error Reduction
Updated 1mo ago
Evaluation Results
Method
Method
Links
GroupScore
Absolute Gain (Δ)
Relative Gain
Error Reduction
TTM
Protocol=Algorithm 1
2025.10
95.87
55.1
135.1
93
SimpleMatch
Protocol=GroupMatch
2025.10
40.78
-
-
-
CLIP-B32
Type=Raw performance,...
2025.10
0
-
-
-
Feedback
Search any
task
Search any
task