Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Therapeutic Dialogue Generation on MESC (test)
Loading...
0.0111
BLEU
Reinforcement Learning - SFT
0.00174
0.00417
0.0066
0.00903
Nov 14, 2025
BLEU
ROUGE-1
ROUGE-2
ROUGE-L
METEOR
Updated 3mo ago
Evaluation Results
Method
Method
Links
BLEU
ROUGE-1
ROUGE-2
ROUGE-L
METEOR
Reinforcement Learning - SFT
Training=Reinforcement...
2025.11
0.0111
0.1397
0.0213
0.1317
0.0581
SFT (Therapist emotions)
Emotion Context=Inclus...
2025.11
0.0108
0.1164
0.0175
0.1076
0.0485
SFT (No therapist emotions)
Emotion Context=None,...
2025.11
0.0106
0.116
0.0159
0.1058
0.0433
GPT-2
Description=Baseline m...
2025.11
0.0021
0.0478
0.0064
0.0408
0.0691
Feedback
Search any
task
Search any
task