Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction Following Evaluation on IFEval (dev)

92Accuracy

GPT-4

74.00878.67983.3588.021Jan 24, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.01
92
2025.01
79.62
2025.01
74.7