Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Target Behavior Attack on MMLU phy
Loading...
99.3
ASR
AiTM
38.46
54.255
70.05
85.845
Feb 20, 2025
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
ASR
AiTM
Framework=Camel, Commu...
2025.02
99.3
AiTM
Framework=Camel, Commu...
2025.02
97.1
AiTM
Framework=AutoGen, Com...
2025.02
90.1
AiTM
Framework=AutoGen, Com...
2025.02
89.4
AiTM
Framework=AutoGen, Com...
2025.02
87.6
AiTM
Framework=Camel, Commu...
2025.02
85.7
AiTM
Framework=AutoGen, Com...
2025.02
79.5
AiTM
Framework=Camel, Commu...
2025.02
79.4
AiTM
Framework=Camel, Commu...
2025.02
77.4
AiTM
Framework=Camel, Commu...
2025.02
72.6
AiTM
Framework=AutoGen, Com...
2025.02
70.8
AiTM
Framework=Camel, Commu...
2025.02
61.2
AiTM
Framework=Camel, Commu...
2025.02
52.3
AiTM
Framework=AutoGen, Com...
2025.02
50.9
AiTM
Framework=AutoGen, Com...
2025.02
45.4
AiTM
Framework=AutoGen, Com...
2025.02
40.8
Feedback
Search any
task
Search any
task