Share your thoughts, 1 month free Claude Pro on usSee more

Human Preference Evaluation on Basque Arena

1,183Arena Content Score

GPT-4o

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4o 2025.06		1,183	1,093	1,188
Claude 3.5 Sonnet 2025.06		1,150	1,082	1,153
70B + CEU IEN 2025.06		1,127	1,083	1,141
8B + CEU IEN+EU 2025.06		1,047	1,038	1,050
8B + CEU IEU 2025.06		1,045	1,034	1,050
8B + CEU IEN 2025.06		1,031	1,036	1,038
8B INSTRUCT EN 2025.06		766	783	722