Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Sycophancy Detection benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Sycophancy Detection
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Sycophancy benchmark (full evaluation set)
Hypocrisy Gap
AUROC
0.732
12
3mo ago
LURE Truthfulness
Gemini 3 Flash
Sycophancy Rate
10
10
7d ago
SYCON-Bench raw n=100
Gemini 3 Flash
Sycophancy Rate
68
10
7d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task