Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
User Simulation on ConvApparel Domain Expert Academic Verbose V2 (held-out agent)
Loading...
12.1
Average Words per Turn
Dynamic, Aware
8.668
9.559
10.45
11.341
May 12, 2026
Average Words per Turn
Total User Words
Negative Sentiment (%)
Iterative Refinement (%)
Updated 21d ago
Evaluation Results
Method
Method
Links
Average Words per Turn
Total User Words
Negative Sentiment (%)
Iterative Refinement (%)
Dynamic, Aware
Zero-shot=true, Simula...
2026.05
12.1
38.5
13.6
31.2
Ground Truth Data
Zero-shot=true, Simula...
2026.05
12
38.7
13.4
31.7
Dynamic, Agnostic
Zero-shot=true, Simula...
2026.05
11.2
35.1
14.5
22.3
Static, Aware
Zero-shot=true, Simula...
2026.05
10.5
32
15.1
25
Static, Agnostic
Zero-shot=true, Simula...
2026.05
8.8
28
18
12.5
Feedback
Search any
task
Search any
task