Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Truthfulness on TruthfulQA DE
Loading...
17.4
Norm. Prob. Mass
Llama-Instruct
16.984
17.092
17.2
17.308
Mar 16, 2026
Norm. Prob. Mass
Updated 1mo ago
Evaluation Results
Method
Method
Links
Norm. Prob. Mass
Llama-Instruct
Mode=SFT, Shots=6-shot
2026.03
17.4
HATified-SFT
Mode=SFT, Shots=6-shot
2026.03
17
Feedback
Search any
task
Search any
task