Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sycophancy Mitigation on TruthfulQA
Loading...
25
Sycophancy Rate
Linear Probe MHA
23.932
31.141
38.35
45.559
Jan 23, 2026
Sycophancy Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Sycophancy Rate
Linear Probe MHA
Model=Llama 3.2, Inter...
2026.01
25
Linear Probe MHA
Model=Gemma-3, Interve...
2026.01
34.4
System Prompt
Model=Llama 3.2
2026.01
37.5
Base
Model=Gemma-3
2026.01
40.7
System Prompt
Model=Gemma-3
2026.01
40.7
Linear Probe Residual
Model=Gemma-3
2026.01
41.2
Random Direction MLP
Model=Gemma-3
2026.01
42.6
Random Direction MHA
Model=Llama 3.2
2026.01
42.7
Linear Probe MLP
Model=Gemma-3
2026.01
43.9
Linear Probe Residual
Model=Llama 3.2
2026.01
44.2
Linear Probe MLP
Model=Llama 3.2
2026.01
44.4
Random Direction MLP
Model=Llama 3.2
2026.01
44.7
Random Direction MHA
Model=Gemma-3
2026.01
45.1
Base
Model=Llama 3.2
2026.01
51.7
Feedback
Search any
task
Search any
task