Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Factuality and Efficiency Evaluation on TruthfulQA
Loading...
55.6
Factuality Score
Full
33.552
39.276
45
50.724
Mar 5, 2026
Factuality Score
Inference Speedup
Updated 1mo ago
Evaluation Results
Method
Method
Links
Factuality Score
Inference Speedup
Full
Model=Dream-7B, Infere...
2026.03
55.6
-
LSP
Model=Dream-7B, Infere...
2026.03
53.5
1.86
LSP
Model=LLaDA-8B, Infere...
2026.03
45.8
2.29
Full
Model=LLaDA-8B, Infere...
2026.03
34.4
-
Feedback
Search any
task
Search any
task