Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Dishonesty Evaluation benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Dishonesty Evaluation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Mistake math (test)
PRISM
Benchmark Dishonesty
44.16
96
13d ago
Insecure code (test)
PRISM
Benchmark Dishonesty
48.91
32
13d ago
Mistake medical (test)
CONCEPTINF
Dishonesty Accuracy
67.52
32
13d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task