Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DRUID

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language Explanation GenerationDRUID (test)
Faithfulness0.102
9
Human Evaluation of ExplanationsDRUID
Helpfulness (MAR)1.917
5
Showing 2 of 2 rows