Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Idiomatic Translation on Mixed Dataset en-te
Loading...
1.83
LLM-eval Score
IdiomCE
1.0396
1.2448
1.45
1.6552
May 28, 2025
LLM-eval Score
COMET
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM-eval Score
COMET
IdiomCE
Model=GPT-4o
2025.05
1.83
0.66
Direct
Model=GPT-4o
2025.05
1.67
0.71
IdiomCE
Model=Gemma2-9b-it
2025.05
1.56
0.62
Direct
Model=Gemma2-9b-it
2025.05
1.46
0.67
IdiomCE
Model=LLama-3.1-8B
2025.05
1.3
0.54
Direct
Model=Indictrans2
2025.05
1.24
0.747
IdiomCE
Model=LLama-3.2-3B
2025.05
1.18
0.51
Direct
Model=LLama-3.1-8B
2025.05
1.12
0.59
Direct
Model=NLLB-200
2025.05
1.1
0.643
Direct
Model=LLama-3.2-3B
2025.05
1.07
0.52
Feedback
Search any
task
Search any
task