Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Target behavior attack on MBPP
Loading...
99.2
ASR
AiTM
-3.968
22.816
49.6
76.384
Feb 20, 2025
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
ASR
AiTM
Framework=MetaGPT, Vic...
2025.02
99.2
AiTM
Framework=Camel, Commu...
2025.02
98.5
AiTM
Framework=AutoGen, Com...
2025.02
96.9
AiTM
Framework=MetaGPT, Vic...
2025.02
96.3
AiTM
Framework=Camel, Commu...
2025.02
95.9
AiTM
Framework=MetaGPT, Vic...
2025.02
95.1
AiTM
Framework=AutoGen, Com...
2025.02
92.4
AiTM
Framework=Camel, Commu...
2025.02
92.3
AiTM
Framework=AutoGen, Com...
2025.02
90.5
AiTM
Framework=MetaGPT, Vic...
2025.02
80.4
AiTM
Framework=AutoGen, Com...
2025.02
76.8
AiTM
Framework=Camel, Commu...
2025.02
73.1
AiTM
Framework=ChatDev, Vic...
2025.02
69.3
AiTM
Framework=ChatDev, Vic...
2025.02
55.9
AiTM
Framework=ChatDev, Vic...
2025.02
0
AiTM
Framework=ChatDev, Vic...
2025.02
0
Feedback
Search any
task
Search any
task