Share your thoughts, 1 month free Claude Pro on usSee more

Natural Language Explanation Generation on DRUID (test)

0.102Faithfulness

CLUE-Span+Steering

Updated 2mo ago

Evaluation Results

Method	Links
CLUE-Span+Steering 2025.05		0.102	28	20	0.77
CLUE-Span+Steering 2025.05		0.099	15	70	0.69
CLUE-Span+Steering 2025.05		0.098	30	47	0.81
CLUE-Span 2025.05		0.089	20	38	0.78
CLUE-Span 2025.05		0.043	23	43	0.76
CLUE-Span 2025.05		0.014	8	79	0.65
PromptBaseline 2025.05		-0.08	-	-	0.6
PromptBaseline 2025.05		-0.12	-	-	0.57
PromptBaseline 2025.05		-0.13	-	-	0.53