Share your thoughts, 1 month free Claude Pro on usSee more

Code Execution on Multi-Agent Evaluation Set

100R@5

Query+

Updated 5mo ago

Evaluation Results

Method	Links
Query+ 2026.01		100	0.76	-
CEM Attack 2026.01		100	0.78	-
fusion attack 2026.01		100	0.85	-
Query+ 2026.01		100	0.75	-
CEM Attack 2026.01		100	0.78	-
fusion attack 2026.01		100	0.83	-