Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Logical Reasoning on Shuffled Objects

89.9Accuracy

In-Writing-BF

19.737.92556.1574.375Jan 12, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.01
89.9--
2026.01
88.7--
2026.01
88.7--
2026.01
88.3--
2026.01
87.6--
2026.01
86.7--
2026.01
84.8--
2026.01
84--
2026.01
82.6--
2026.01
48.3--
2026.01
46.7--
2026.01
46.4--
2026.01
45.8--
2026.01
39--
2026.01
38.2--
2026.01
34.8--
2026.01
27--
2026.01
24.3--
2026.01
22.4--
2022.05
-31.329.7
2022.05
-52.452.9