Share your thoughts, 1 month free Claude Pro on usSee more

Human Evaluation of Explanations on DRUID

1.917Helpfulness (MAR)

CLUE-Span

Updated 2mo ago

Evaluation Results

Method	Links
CLUE-Span 2025.05		1.917	1.75	1.983	1.75	1.9
PromptBaseline 2025.05		1.9	1.717	1.983	1.767	1.9
CLUE-Span+Steering 2025.05		1.767	1.617	1.683	1.617	1.817
CLUE 2025.05		0.688	0.691	0.739	0.717	0.688
PromptBaseline 2025.05		0.312	0.309	0.261	0.283	0.313