Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deception Detection on Liars' Bench Convincing Game (test)
Loading...
1
AUROC
Single (28)
0.95
0.975
1
1.025
Apr 15, 2026
AUROC
Percentage Improvement
Updated 1mo ago
Evaluation Results
Method
Method
Links
AUROC
Percentage Improvement
Single (28)
Layer=28
2026.04
1
-
3-Layer Ensemble
Stacking Method=logist...
2026.04
1
-
5-Layer Ensemble
Stacking Method=logist...
2026.04
1
0
Feedback
Search any
task
Search any
task