Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Reasoning on Terminal Bench Core 2.0

37.5Success Rate

Qwen3.5-122B-A10B

17.94823.02428.133.176Apr 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
37.5
31
18.7