Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Language Evaluation on Aggregated Benchmarks
Loading...
0.7449
Average Score
Qwen3-14B + NGM
0.340964
0.445832
0.5507
0.655568
May 16, 2026
Average Score
Average Improvement
Updated 15d ago
Evaluation Results
Method
Method
Links
Average Score
Average Improvement
Qwen3-14B + NGM
Model Scale=14B, NGM C...
2026.05
0.7449
0.0072
Qwen3-14B
Model Scale=14B, NGM C...
2026.05
0.7377
-
Qwen3-8B + NGM
Model Scale=8B, NGM Co...
2026.05
0.7217
0.0081
Qwen3-8B
Model Scale=8B, NGM Co...
2026.05
0.7135
-
Qwen3-4B + NGM
Model Scale=4B, NGM Co...
2026.05
0.6411
0.0058
Qwen3-4B
Model Scale=4B, NGM Co...
2026.05
0.6353
-
Qwen3-1.7B + NGM
Model Scale=1.7B, NGM...
2026.05
0.5516
0.0048
Qwen3-1.7B
Model Scale=1.7B, NGM...
2026.05
0.5468
-
Qwen3-0.6B + NGM
Model Scale=0.6B, NGM...
2026.05
0.3686
0.0121
Qwen3-0.6B
Model Scale=0.6B, NGM...
2026.05
0.3565
-
Feedback
Search any
task
Search any
task