Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Utterance-level User Simulation on Chinese User Simulation Dataset

69.92AI Probability

UserLM

29.630440.090250.5561.0098Apr 15, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.04
69.9258.8855.3851.1152.8947.4256.55
2026.04
45.571.8162.7988.9192.2183.0492.14
2026.04
44.4573.0764.6986.4491.0278.8889.76
2026.04
43.1466.4459.2563.8468.7457.7973.14
2026.04
37.9876.0664.890.4695.9185.393.89
2026.04
31.1875.3464.8991.9697.7687.6396.2