Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Customer Service on TauBench
Loading...
91.8
Accuracy
LLM-guided spec search
7.872
29.661
51.45
73.239
May 16, 2026
Accuracy
Delta (%)
Updated 15d ago
Evaluation Results
Method
Method
Links
Accuracy
Delta (%)
LLM-guided spec search
Student Model=Qwen3.5-...
2026.05
91.8
14.7
Qwen3.5-9B
Student Model=Qwen3.5-...
2026.05
77.1
-
LLM-guided spec search
Student Model=Gemma4-E...
2026.05
72.4
16.3
LLM-guided spec search
Student Model=Qwen3.5-...
2026.05
56.7
12.3
Gemma4-E4B
Student Model=Gemma4-E...
2026.05
56.1
-
Qwen3.5-4B
Student Model=Qwen3.5-...
2026.05
44.4
-
LLM-guided spec search
Student Model=Nemotron...
2026.05
25.6
14.5
Nemotron-Nano-4B
Student Model=Nemotron...
2026.05
11.1
-
Feedback
Search any
task
Search any
task