| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| PHIL | Supervised Pinpoint Tuning | Sycophancy Preference99.34 | 10 | 4d ago | |
| POLI | Ours Resid | Sycophantic Preference (%)92.18 | 10 | 4d ago | |
| NLP | Synthetic Data Intervention | Sycophancy Preference49.25 | 10 | 4d ago | |
| Open-Ended Sycophancy | Synthetic Data Intervention | Syc Score48.15 | 10 | 4d ago | |
| Syco-Bench | Pickside Score1.21 | 10 | 4d ago | ||
| Offline Evaluation Set | gpt-5-thinking | Sycophancy Prevalence Score4 | 3 | 4d ago | |
| Early A/B tests Online prevalence | gpt-5-main | Prevalence Change (Free Users)-0.69 | 1 | 4d ago |