Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM agent tool-selection tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tool Selection HijackingLLM agent tool-selection tasks
Attack Success Rate (ASR)69.8
9
Showing 1 of 1 rows