Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-party Dialogue Generation on Multi-party dialogue (test)
Loading...
0.82
Coherence
Ground Truth
0.5704
0.6352
0.7
0.7648
Apr 8, 2026
Coherence
Fluency
Informativeness
Helpfulness
Overall Score
Updated 9d ago
Evaluation Results
Method
Method
Links
Coherence
Fluency
Informativeness
Helpfulness
Overall Score
Ground Truth
2026.04
0.82
0.89
0.93
0.87
3.51
Qwen3-8B+DRCR
Backbone=Qwen3-8B, Met...
2026.04
0.79
0.94
0.86
0.82
3.41
SS-MPC
2026.04
0.73
0.91
0.82
0.79
3.25
Qwen3-8B+SFT
Backbone=Qwen3-8B, Fin...
2026.04
0.71
0.9
0.82
0.76
3.19
RL-TRC
2026.04
0.69
0.9
0.81
0.72
3.12
MADNet
2026.04
0.58
0.87
0.76
0.63
2.84
Feedback
Search any
task
Search any
task