Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on Shuffled Objects

92.1Accuracy

GateKD

19.61238.43157.2576.069Jan 12, 2026Feb 1, 2026Feb 21, 2026Mar 13, 2026Apr 2, 2026Apr 22, 2026May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
92.1--
2026.05
90.6--
2026.01
89.9--
2026.05
89.7--
2026.01
88.7--
2026.01
88.7--
2026.01
88.3--
2026.05
88.3--
2026.05
88.1--
2026.01
87.6--
2026.01
86.7--
2026.05
85.1--
2026.05
84.9--
2026.01
84.8--
2026.05
84.3--
2026.01
84--
2026.05
83.7--
2026.05
83.1--
2026.01
82.6--
2026.05
82.6--
2026.05
81--
2026.05
80.8--
2026.05
78.4--
2026.05
76.4--
2026.05
72.9--
2026.05
70--
2026.05
68.2--
2026.05
66.1--
2026.05
63.7--
2026.01
48.3--
2026.01
46.7--
2026.01
46.4--
2026.01
45.8--
2026.01
39--
2026.01
38.2--
2026.01
34.8--
2026.01
27--
2026.01
24.3--
2026.01
22.4--
2022.05
-31.329.7
2022.05
-52.452.9