Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Instruction Following Evaluation on Ours hard seed data

56.73Score

GPT-4 Turbo

47.141249.630652.1254.6094Sep 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.09
56.73--
2024.09
53.75--
2024.09
51.75--
2024.09
49.85--
2024.09
47.51--
-51.9210.06