Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Tool Use on τ²-Bench Telecom
Loading...
99.3
Accuracy
Qwen3.5-27B
42.516
57.258
72
86.742
Apr 9, 2026
Accuracy
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-27B
Architecture=Dense, #...
2026.04
99.3
GPT-5 mini
Reasoning Mode=REASONI...
2026.04
74.1
K-EXAONE-236B-A23B
Architecture=MoE, # To...
2026.04
73.5
EXAONE 4.5 33B
Architecture=Dense, #...
2026.04
73
Qwen3-VL-235B-A22B
Architecture=MoE, # To...
2026.04
44.7
Feedback
Search any
task
Search any
task