Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Reasoning on Terminal Bench hard

26.8Success Rate

Qwen3.5-122B-A10B

23.88824.64425.426.156Apr 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
26.8
25.78
24