Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Vision-Language Spatial Reasoning on WhatsUp A-OU 2x2 directional variants
Loading...
99.03
GroupScore
TTM
0.074
25.7645
51.455
77.1455
Oct 9, 2025
GroupScore
Absolute Gain
Relative Gain
Error Reduction
Updated 1mo ago
Evaluation Results
Method
Method
Links
GroupScore
Absolute Gain
Relative Gain
Error Reduction
TTM
Protocol=Algorithm 1
2025.10
99.03
20.4
25.9
95.5
SimpleMatch
Protocol=GroupMatch
2025.10
78.64
-
-
-
CLIP-B32
Type=Raw performance,...
2025.10
3.88
-
-
-
Feedback
Search any
task
Search any
task