Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning on MetaBench Generalization

29.55Accuracy

Content-SharpRouter

25.483626.539327.59528.6507Mar 31, 2026
Updated 18d ago

Evaluation Results

MethodLinks
2026.03
29.55
2026.03
29.38
2026.03
28.98
2026.03
28.67
2026.03
27.7
2026.03
26.97
2026.03
26.79
2026.03
25.64