Share your thoughts, 1 month free Claude Pro on usSee more

Human Preference Evaluation for Code-switched Text Generation on EN-CS (Out of domain)

434.5Score

Gold Standard

Updated 4mo ago

Evaluation Results

Method	Links
Gold Standard 2025.02		434.5	1
Llama3 2025.02		282	2
NLLB 2025.02		247.5	3
Llama3 Instruct 2025.02		210	4
Llama3.3-70Bfs 2025.02		164	5
GPT-4ofs 2025.02		162	6