Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Idiomatic Translation on Mixed Dataset en-hi
Loading...
2.39
LLM-eval Score
IdiomCE
1.0692
1.4121
1.755
2.0979
May 28, 2025
LLM-eval Score
COMET Score
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM-eval Score
COMET Score
IdiomCE
Model=GPT-4o
2025.05
2.39
0.7
Direct
Model=GPT-4o
2025.05
2.14
0.73
IdiomCE
Model=Gemma2-9b-it
2025.05
1.88
0.68
IdiomCE
Model=LLama-3.1-8B
2025.05
1.655
0.63
Direct
Model=Gemma2-9b-it
2025.05
1.6
0.73
IdiomCE
Model=LLama-3.2-3B
2025.05
1.34
0.59
Direct
Model=NLLB-200
2025.05
1.3
0.7
Direct
Model=LLama-3.1-8B
2025.05
1.27
0.68
Direct
Model=Indictrans2
2025.05
1.247
0.74
Direct
Model=LLama-3.2-3B
2025.05
1.12
0.62
Feedback
Search any
task
Search any
task