Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenDialKG

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge-intensive DialogueOpendialKG
Factual Accuracy88.84
11
Knowledge-grounded dialogue generationOpenDialKG (test)
BLEU-420.77
10
Hallucination detectionOpenDialKG Eval (test)
Macro F176.2
7
ConversationOpenDialKG
Dist-22.9162
7
RecommendationOpenDialKG
Recall@128.95
7
RDF-to-text generationOpenDialKG (test)
Grammaticality98.5
6
Knowledge-grounded Dialogue GenerationOpenDialKG
Faithfulness81.67
4
Knowledge-Grounded Dialogue Generation (Fluency)OpenDialKG
Win Rate37.33
4
Showing 8 of 8 rows