Share your thoughts, 1 month free Claude Pro on usSee more

SOTA Dialogue Policy Evaluation benchmarks and papers with code | Wizwand

Share your thoughts, 1 month free Claude Pro on usSee more

Dialogue Policy Evaluation

Benchmarks

Dataset Name	SOTA Method	Metric	Trend
PersonaChat (test)		USR RET97.7		10	4mo ago
Empathetic Dialogues (test)		USR MLM0.912		8	4mo ago
Dailydialog (test)		USR MLM81.1		8	4mo ago

Showing 3 of 3 rows