Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Language Evaluation on Aggregated Benchmarks

0.7449Average Score

Qwen3-14B + NGM

0.3409640.4458320.55070.655568May 16, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.05
0.74490.0072
2026.05
0.7377-
2026.05
0.72170.0081
2026.05
0.7135-
2026.05
0.64110.0058
2026.05
0.6353-
2026.05
0.55160.0048
2026.05
0.5468-
2026.05
0.36860.0121
2026.05
0.3565-