Share your thoughts, 1 month free Claude Pro on usSee more

Task-Focused Dialogue on Mgshop (TSE, Reward, BLEU)

94.75TSE

GOPO

Updated 4mo ago

Evaluation Results

Method	Links
GOPO 2026.01		94.75	7.63	27.9
DeepSeek 2026.01		93.81	7.46	14.3
GLM 2026.01		93.65	7.25	15
GPT 2026.01		93.38	7.54	9.7
Gemini 2026.01		92.87	7.35	13.3
GOPO 2026.01		92.43	7.38	21.1
Qwen 2026.01		92.27	7.24	18.2
PPO 2026.01		85.84	7.09	19
Memento 2026.01		83.81	7.13	18.8
SFT 2026.01		83.63	6.25	18.7
Untrained 2026.01		74.54	5.97	9.1