Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sycophancy benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Stealth Sycophancy DetectionThree-source Sycophancy Benchmark (test)
Spearman Correlation0.9567
17
Sycophancy DetectionSycophancy benchmark (full evaluation set)
AUROC0.732
12
Vision Language Model EvaluationSycophancy Benchmark
Mean Score88.8
6
Showing 3 of 3 rows