Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue on WildChat
Loading...
47.3
Lexical Coverage
BACO best
-1.892
10.879
23.65
36.421
Nov 7, 2025
Lexical Coverage
Lexical Dominance
Semantic Coverage
Semantic Dominance
Overall Coverage
Overall Dominance
Updated 21h ago
Evaluation Results
Method
Method
Links
Lexical Coverage
Lexical Dominance
Semantic Coverage
Semantic Dominance
Overall Coverage
Overall Dominance
BACO best
Representative Router=...
2025.11
47.3
27.4
45.4
48.5
46.3
38
Nudging
2025.11
43
11.4
38.7
15.6
40.8
13.5
Aligned
2025.11
25.3
59.2
7.7
29.1
16.5
44.1
Base
2025.11
0
1.9
0
6.8
0
4.38
Feedback
Search any
task
Search any
task