Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Truthfulness Evaluation on TruthfulQA (Adversarial Robustness)

100Accuracy under Attack

Reporting-and-penalty mechanism

68.876.98593.1Apr 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
100100
2026.04
7092