Share your thoughts, 1 month free Claude Pro on usSee more

Open-ended Instruction Following on AlpacaEval GPT-5.2-judged (test)

64.4Win Rate

Hybrid KD

Updated 1mo ago

Evaluation Results

Method	Links
Hybrid KD 2026.05		64.4
Soft KD 2026.05		61.3
Hard KD 2026.05		57.5