Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HumDial

Benchmarks

Task NameDataset NameSOTA ResultTrend
Empathetic Response GenerationHumDial Challenge Track 1 Task 3-en (dev)
LLM Score (0-5)4.36
6
Empathetic Response GenerationHumDial Challenge Track 1 Task 3-zh (dev)
LLM Score (0-5)4.53
6
Emotional ReasoningHumDial Challenge Track 1 Task 2-zh (dev)
LLM Score4.98
6
Full-duplex dialogueHumDial 1.5 (dev)
First Response Delay1.528
2
Full-duplex dialogueHumDial 1.5 (test)
Interruption Score89.7
1
Showing 5 of 5 rows