Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WoW

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge-grounded DialogueWoW
F1 Score17.35
15
Retrieval-Augmented GenerationWoW
LLM Score88.87
11
DialogueWoW
F1 Score14.77
8
Knowledge-grounded Dialog GenerationWoW (Seen)
Appropriateness Score4.5
6
Natural Language GenerationWoW
Mean Relevance4.68
5
Knowledge-grounded DialogueWoW (test)
Dialogue Turns163
2
Showing 6 of 6 rows