Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Red-teaming on AI Supremacy principle v1 (test)
Loading...
11.7
Mean Best Category Score
CRL
1.0504
3.8152
6.58
9.3448
Feb 12, 2026
Mean Best Category Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Best Category Score
CRL
Target Model=Llama-3.1...
2026.02
11.7
CRL
Target Model=Gemma3-12...
2026.02
11.4
QCI
Target Model=Llama-3.1...
2026.02
10.9
CRL
Target Model=GPT-4.1-Mini
2026.02
10.9
QCI
Target Model=Qwen3-30B...
2026.02
10.3
CRL
Target Model=Qwen3-30B...
2026.02
10.2
QCI
Target Model=GPT-4.1-Mini
2026.02
10.1
QCI
Target Model=Gemma3-12...
2026.02
9.88
RS
Target Model=Llama-3.1...
2026.02
2.87
RS
Target Model=Gemma3-12...
2026.02
1.88
RS
Target Model=Qwen3-30B...
2026.02
1.52
RS
Target Model=GPT-4.1-Mini
2026.02
1.46
Feedback
Search any
task
Search any
task