Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
User Simulation on ConvApparel Efficient Matchmaker Ultra-terse Fast V2 (held-out agent)
Loading...
25.4
Avg. Words per Turn
Static, Agnostic
11.152
14.851
18.55
22.249
May 12, 2026
Avg. Words per Turn
Total User Words
Negative Sentiment Rate
Iterative Refinement Rate
Updated 21d ago
Evaluation Results
Method
Method
Links
Avg. Words per Turn
Total User Words
Negative Sentiment Rate
Iterative Refinement Rate
Static, Agnostic
Zero-shot=true, Simula...
2026.05
25.4
92.1
26.8
8.8
Dynamic, Agnostic
Zero-shot=true, Simula...
2026.05
14.1
45.2
18.2
24.8
Static, Aware
Zero-shot=true, Simula...
2026.05
13.2
43.5
16.4
32.6
Dynamic, Aware
Zero-shot=true, Simula...
2026.05
11.8
40.1
12.2
40.9
Ground Truth Data
Zero-shot=true, Simula...
2026.05
11.7
39.7
11.9
41.3
Feedback
Search any
task
Search any
task