Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on CoT-Collection Scenario 1

70Accuracy

LaDa

38.38446.59254.863.008Feb 21, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
700.2
2026.02
69.8-
2026.02
69.2-
2026.02
693.4
2026.02
66.2-
2026.02
65.6-
2026.02
64.9-
2026.02
542
2026.02
541.1
2026.02
52.9-
2026.02
52.5-
2026.02
52-
2026.02
51.3-
2026.02
51.21
2026.02
50.6-
2026.02
50.2-
2026.02
49.62
2026.02
48.6-
2026.02
48.6-
2026.02
47.83.8
2026.02
47.6-
2026.02
47-
2026.02
45.2-
2026.02
44.6-
2026.02
44.6-
2026.02
44.2-
2026.02
44-
2026.02
43.8-
2026.02
43.6-
2026.02
43.50.8
2026.02
43.2-
2026.02
42.8-
2026.02
42.7-
2026.02
41.9-
2026.02
41.6-
2026.02
41.6-
2026.02
41-
2026.02
40.8-
2026.02
40.1-
2026.02
39.6-