Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-turn dialogue routing on ShareGPT-LF Llama Series Set (cross-domain (legal and financial))
Loading...
90.07
Success Rate (SR)
MCTS Router
61.1372
68.6486
76.16
83.6714
Apr 14, 2026
Success Rate (SR)
Average Turns (AT)
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average Turns (AT)
MCTS Router
Candidate Set=Llama Se...
2026.04
90.07
5.14
Greedy Router
Candidate Set=Llama Se...
2026.04
86.47
5.26
DialRouter
Candidate Set=Llama Se...
2026.04
83.03
5.48
KNN Router
Candidate Set=Llama Se...
2026.04
81.47
5.15
Llama3.1-8B-Instruct
Candidate Set=Llama Se...
2026.04
80.18
5.09
Avengers
Candidate Set=Llama Se...
2026.04
80.18
5.09
Random
Candidate Set=Llama Se...
2026.04
77.62
5.76
Llama3.2-3B-Instruct
Candidate Set=Llama Se...
2026.04
75.17
5.36
Llama3.2-1B-Instruct
Candidate Set=Llama Se...
2026.04
62.25
6.44
Feedback
Search any
task
Search any
task