Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response Generation on CS Resp. (test)
Loading...
72.4
BS
Always-T4
61.376
64.238
67.1
69.962
Apr 26, 2026
BS
Updated 1mo ago
Evaluation Results
Method
Method
Links
BS
Always-T4
tier=T4
2026.04
72.4
Hybrid LLM
2026.04
70.2
RouteLLM
2026.04
69.8
ROUTENLP
2026.04
69.7
FrugalGPT
2026.04
69.5
AutoMix
2026.04
68.4
Always-T2
tier=T2
2026.04
61.8
Feedback
Search any
task
Search any
task