Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Violation Scenario Generation on Scenario S1
Loading...
4.9
Mean Score
LawBreaker
4.7016
6.0408
7.38
8.7192
Feb 5, 2026
Mean Score
Max Score
High Violation Rate
Rate > 6 Violations
Rate > 8 Violations
Rate > 10 Violations
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Score
Max Score
High Violation Rate
Rate > 6 Violations
Rate > 8 Violations
Rate > 10 Violations
LawBreaker
2026.02
4.9
11
40
14
6
-
ABLE
2026.02
9.12
20
64
49
38
-
ROMAN
2026.02
9.86
22
75
61
42
-
Feedback
Search any
task
Search any
task