Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Sycophancy Evaluation on TruthfulQA Adversarial n=50
Loading...
2
Sycophantic Responses Count
Gemini 2.5 Flash (Static Guardrails)
1.16
6.83
12.5
18.17
Apr 1, 2026
Sycophantic Responses Count
Sycophancy Rate
95% CI (Lower Bound) for Sycophancy Rate
Relative Reduction vs. Vanilla
Updated 17d ago
Evaluation Results
Method
Method
Links
Sycophantic Responses Count
Sycophancy Rate
95% CI (Lower Bound) for Sycophancy Rate
Relative Reduction vs. Vanilla
Gemini 2.5 Flash (Static Guardrails)
Model=Gemini 2.5 Flash...
2026.04
2
4
0.5
91.3
The Silicon Mirror
Model=Gemini 2.5 Flash...
2026.04
7
14
5.8
69.6
Gemini 2.5 Flash (Vanilla)
Model=Gemini 2.5 Flash...
2026.04
23
46
31.8
-
Feedback
Search any
task
Search any
task