Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Coding on Terminal-Bench 1.1

43Resolved Rate

Spell GPT-5.4

34.6836.843941.16May 7, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
4337.886,480,000
2026.05
4383.8694,650,000
2026.05
4040.726,520,000
2026.05
4027.3610,210,000
2026.05
3946.9651,140,000
2026.05
3839.4339,930,000
2026.05
3625.726,960,000
2026.05
3512.2610,110,000
2026.05
3519.1122,510,000