Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on General Capability

0.685Capability Score

IT+RL

0.298120.398560.4990.59944Dec 29, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
0.685
2025.12
0.674
2025.12
0.67
2025.12
0.67
2025.12
0.652
2025.12
0.57
2025.12
0.545
2025.12
0.495
2025.12
0.431
2025.12
0.411
2025.12
0.404
2025.12
0.392
2025.12
0.39
2025.12
0.366
2025.12
0.365
2025.12
0.362
2025.12
0.35
2025.12
0.34
2025.12
0.339
2025.12
0.321
2025.12
0.313