Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Phishing Worm on Single-Agent Evaluation Set
Loading...
100
R@5
CEM Attack
54.24
66.12
78
89.88
Jan 11, 2026
R@5
SIM
Phishing
Worm
Updated 4d ago
Evaluation Results
Method
Method
Links
R@5
SIM
Phishing
Worm
CEM Attack
Model=GPT-4o
2026.01
100
77
-
-
fusion attack
Model=GPT-4o
2026.01
100
81
-
-
CEM Attack
Model=GPT-4o-mini
2026.01
100
77
-
-
fusion attack
Model=GPT-4o-mini
2026.01
100
81
-
-
Query+
Model=GPT-4o-mini
2026.01
63
70
-
-
Query+
Model=GPT-4o
2026.01
56
70
-
-
Feedback
Search any
task
Search any
task