Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deception Detection on Liars' Bench Harm-Pressure Choice (test)
Loading...
0.949
AUROC
Single (28)
0.897
0.9105
0.924
0.9375
Apr 15, 2026
AUROC
Improvement (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
AUROC
Improvement (%)
Single (28)
Layer=28
2026.04
0.949
-
5-Layer Ensemble
Stacking Method=logist...
2026.04
0.909
4.2
3-Layer Ensemble
Stacking Method=logist...
2026.04
0.899
-
Feedback
Search any
task
Search any
task