Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following on IF-Eval

84.66Accuracy

General Teacher

Updated 1mo ago

Evaluation Results

Method	Links
General Teacher 2026.05		84.66
Base 2025.09		63.7
CaMOPD 2026.05		59.89
IPO 2025.09		56.2
RealSafe 2025.09		54.7
SelecTKD 2026.05		49.17
Medical Teacher 2026.05		48.98
Relaxed OPD 2026.05		48.61
Vanilla MOPD 2026.05		48.06
Base Model 2026.02		40.48
Numerical 2026.02		39.93
Random 2026.02		39.74
WIM Fixed Judge 2026.02		39.56
WIM Changing Judge 2026.02		39.19