Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Automated Probing on TruthfulQA
Loading...
40
Error Rate (%)
PAIR
38.48
48.74
59
69.26
Feb 13, 2026
Error Rate (%)
Attack Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Error Rate (%)
Attack Success Rate
PAIR
Generator Model=GPT-5....
2026.02
40
93.86
AutoDetect
Generator Model=GPT-5....
2026.02
56
-
PROBELLM
Generator Model=GPT-5....
2026.02
78
-
Feedback
Search any
task
Search any
task