Share your thoughts, 1 month free Claude Pro on usSee more

Instruction-following clustering on REASONCLUSTER (Overall)

68.42V-measure (%)

C1-Qwen-14B

Updated 4mo ago

Evaluation Results

Method	Links
C1-Qwen-14B 2026.03		68.42
C1-Qwen-7B 2026.03		66.54
o3 2026.03		65.08
Gemini 2.5 Pro 2026.03		61.82
QwQ-32B 2026.03		54.78
GPT-oss-120B 2026.03		52.31
Distill-Llama-70B 2026.03		45.12
DeepSeek-R1 2026.03		44.06
GPT-4.1 2026.03		43.51
GPT-4o 2026.03		41.26
Distill-Qwen-32B 2026.03		34.96
Llama-3.1-70B-Instruct 2026.03		31.55
Qwen2.5-72B-Instruct 2026.03		29.06