Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning on Reasoning Suite Average

74.8Accuracy

RLCM

-2.50912817.56151137.6321557.702789Jun 4, 2025Jul 28, 2025Sep 20, 2025Nov 13, 2025Jan 6, 2026Mar 1, 2026Apr 25, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
74.8
2026.04
73.3
2026.02
72.8
2026.02
72.4
2026.04
72.3
2026.04
71.9
2026.02
71.3
2026.02
71
2026.02
69.2
2026.02
68.9
2026.02
68.7
2026.02
68.5
2026.04
68.3
2026.04
67.5
2026.02
64.9
2026.02
64.4
2026.04
62.9
2026.04
62.7
2026.04
58.9
2025.06
0.5729
2025.06
0.5727
2025.06
0.5652
2025.06
0.5626
2025.06
0.5612
2025.06
0.5608
2025.06
0.5602
2025.06
0.5577
2025.06
0.5553
2025.06
0.5529
2025.06
0.5512
2025.06
0.5495
2025.06
0.5489
2025.06
0.5427
2025.06
0.5418
2025.06
0.5371
2025.06
0.4955
2025.06
0.4937
2025.06
0.4874
2025.06
0.4859
2025.06
0.4842
2025.06
0.4791
2025.06
0.4756
2025.06
0.475
2025.06
0.4703
2025.06
0.4643