Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Factual Question Answering on TruthfulQA (AUROC, 95% CI)
Loading...
59.9
AUROC
B1 entropy
42.012
46.656
51.3
55.944
Mar 25, 2026
AUROC
AUROC 95% CI
Updated 23d ago
Evaluation Results
Method
Method
Links
AUROC
AUROC 95% CI
B1 entropy
Cost=Free
2026.03
59.9
55.9
SelfCheck (k=5)
Cost=6×
2026.03
58.8
54.7
Semantic entropy (N=10)
Cost=11×
2026.03
54.8
51
P(True)
Cost=1 call
2026.03
42.7
40.8
Feedback
Search any
task
Search any
task