Share your thoughts, 1 month free Claude Pro on usSee more

Coding on Terminal-Bench 2.0

59.3Score

Claude Opus 4.5

Updated 4mo ago

Evaluation Results

Method	Links
Claude Opus 4.5 2026.02		59.3	-
Claude Opus 4.5 2026.02		57.9	-
GLM-5 2026.02		56.2	60.7
GLM-5 2026.02		56.2	61.1
Gemini 3 Pro 2026.02		54.2	-
GPT-5.2 (xhigh) 2026.02		54	-
Kimi K2.5 2026.02		50.8	-
DeepSeek-V3.2 2026.02		46.4	-
GLM-4.7 2026.02		41	-
DeepSeek-V3.2 2026.02		39.3	-
GLM-4.7 2026.02		32.8	-