Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Machine Translation Evaluation on WMT 24
Loading...
0.945
Quality (cs-uk)
Llama 4 Scout
0.78796
0.82873
0.8695
0.91027
Mar 10, 2026
Quality (cs-uk)
Quality (en-cs)
Quality (en-is)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Quality (cs-uk)
Quality (en-cs)
Quality (en-is)
Llama 4 Scout
Shot=Few
2026.03
0.945
0.276
0.299
Qwen 3 30B
Shot=Zero
2026.03
0.937
0.286
0.269
Llama 4 Scout
Shot=Zero
2026.03
0.92
0.472
0.413
Qwen 3 30B
Shot=Few
2026.03
0.918
0.325
0.37
Llama 3.3 70B
Shot=Few
2026.03
0.854
0.41
0.419
Llama 3.3 70B
Shot=Zero
2026.03
0.794
0.491
0.513
Feedback
Search any
task
Search any
task