Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Generation on Norm-grounded Dialogue (Chinese) (test)
Loading...
79
Win Rate vs NormDial
Qwen-2.5-32B
70.68
72.84
75
77.16
Sep 22, 2025
Win Rate vs NormDial
Win Rate vs SODA
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Rate vs NormDial
Win Rate vs SODA
Qwen-2.5-32B
Model=Qwen-2.5-32B
2025.09
79
-
GPT-4o-mini
Model=GPT-4o-mini
2025.09
75
-
Qwen-2.5-14B
Model=Qwen-2.5-14B
2025.09
71
-
Feedback
Search any
task
Search any
task