Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factuality assessment on TruthfulQA using Personalization Bias (PB)
Loading...
0.148
Personalization Bias (PB)
Identity-Robust Generation
0.09204
0.46977
0.8475
1.22523
Jan 14, 2026
Personalization Bias (PB)
Updated 4d ago
Evaluation Results
Method
Method
Links
Personalization Bias (PB)
Identity-Robust Generation
Model=Llama3.3-70B
2026.01
0.148
Identity-Robust Generation
Model=Qwen3
2026.01
0.241
Identity-Robust Generation
Model=gpt-oss-20B
2026.01
0.346
Prompt Steering
Model=Qwen3
2026.01
0.625
Prompt Steering
Model=Llama3.3-70B
2026.01
0.729
Vanilla Generation
Model=Qwen3
2026.01
0.864
Vanilla Generation
Model=Llama3.3-70B
2026.01
0.894
Vanilla Generation
Model=gpt-oss-20B
2026.01
1.504
Prompt Steering
Model=gpt-oss-20B
2026.01
1.547
Feedback
Search any
task
Search any
task