Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following Evaluation on SELF-INSTRUCT seed data

72.01Score

GPT-4 Turbo

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4 Turbo 2024.09		72.01	-	-
GLM-4 2024.09		71.86	-	-
Claude3 2024.09		71.71	-	-
Qwen 2024.09		71.11	-	-
GPT-4 2024.09		70.06	-	-
Aggregate Statistics 2024.09		-	71.35	0.51