Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Explanation Generation on DRUID (test)

0.102Faithfulness

CLUE-Span+Steering

-0.13928-0.07664-0.0140.04864May 23, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
0.10228200.77
2025.05
0.09915700.69
2025.05
0.09830470.81
2025.05
0.08920380.78
2025.05
0.04323430.76
2025.05
0.0148790.65
2025.05
-0.08--0.6
2025.05
-0.12--0.57
2025.05
-0.13--0.53