DRUID

Benchmarks

Task Name	Dataset Name	SOTA Result	Trend
Natural Language Explanation Generation	DRUID (test)	Faithfulness0.102		9
Human Evaluation of Explanations	DRUID	Helpfulness (MAR)1.917		5

Showing 2 of 2 rows