Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeceptionBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Deception DetectionDeceptionBench (full 600-sample)
Response AUROC97.4
15
Deception DetectionDeceptionBench
Response AUROC93.5
6
Showing 2 of 2 rows