Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adversarial Code Compliance on Overall Mean

97.1Decoupling Probability

Llama-3.1-8B

26.79645.04863.381.552Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
97.148.4
2026.01
95.851
2026.01
92.238
2026.01
8930.4
2026.01
83.642.9
2026.01
77.731.7
2026.01
71.628.3
2026.01
32.41.6
2026.01
29.50.6