Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue on MT-Bench (full set)
Loading...
9.3
Accuracy (%)
Ref. SOTA
8.884
8.992
9.1
9.208
Mar 26, 2026
Accuracy (%)
Delta Score
p-value
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy (%)
Delta Score
p-value
Ref. SOTA
Model Type=Proprietary
2026.03
9.3
-
-
EcoThink
2026.03
8.9
-0.4
0.145
Feedback
Search any
task
Search any
task