Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Social Norm Dialogue Generation on Prosocial-Dialog Korean
Loading...
68
Human Preference
V2R
30.56
40.28
50
59.72
Sep 22, 2025
Human Preference
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Preference
V2R
Evaluation condition=B...
2025.09
68
Untuned GPT-4o-mini
Evaluation condition=B...
2025.09
32
Feedback
Search any
task
Search any
task