| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sycophancy Assessment | BASIL 1.0 (Under-Update) | Change in Bayesian Error (RMSE)-0.355 | 32 | |
| Sycophancy Assessment | BASIL Over-Update 1.0 | Change in Bayesian Error (RMSE)0.016 | 32 | |
| Sycophancy Assessment | BASIL 1.0 (All) | Change in Bayesian Error (RMSE)-0.096 | 32 | |
| Bayesian Assessment of Sycophancy | BASIL User belief setting 1.0 (test) | Bayesian Error (RMSE)0.156 | 18 | |
| Bayesian Assessment of Sycophancy | BASIL Third-p. belief setting 1.0 (test) | Bayesian Error (RMSE)0.16 | 18 | |
| Bayesian Assessment of Sycophancy | BASIL Abstract setting 1.0 (test) | Bayesian Error (RMSE)0.197 | 18 |