Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Task-Focused Dialogue on Multiwoz (G-Eval)

3.453G-Eval Score

GPT-5.2

3.3493.3763.4033.43Jan 24, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
3.453
2026.01
3.447
2026.01
3.407
2026.01
3.381
3.368
2026.01
3.353