Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on Reasoning Suite Average

72.8Accuracy

GHG-TDA

-2.42912817.10151136.6321556.162789Jun 4, 2025Jul 15, 2025Aug 26, 2025Oct 7, 2025Nov 18, 2025Dec 30, 2025Feb 10, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
72.8
2026.02
72.4
2026.02
71.3
2026.02
71
2026.02
69.2
2026.02
68.9
2026.02
68.7
2026.02
68.5
2026.02
64.9
2026.02
64.4
2025.06
0.5729
2025.06
0.5727
2025.06
0.5652
2025.06
0.5626
2025.06
0.5612
2025.06
0.5608
2025.06
0.5602
2025.06
0.5577
2025.06
0.5553
2025.06
0.5529
2025.06
0.5512
2025.06
0.5495
2025.06
0.5489
2025.06
0.5427
2025.06
0.5418
2025.06
0.5371
2025.06
0.4955
2025.06
0.4937
2025.06
0.4874
2025.06
0.4859
2025.06
0.4842
2025.06
0.4791
2025.06
0.4756
2025.06
0.475
2025.06
0.4703
2025.06
0.4643