Share your thoughts, 1 month free Claude Pro on usSee more

Error Detection on TruthfulQA 200-question subset

0.512AUROC

SE-NLI

Updated 4mo ago

Evaluation Results

Method	Links
SE-NLI 2026.03		0.512	0.421	11
SE-NLI 2026.03		0.511	0.419	11
SE-NLI 2026.03		0.501	0.404	11