Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Agents Failure Attribution

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-agent recommendationAgents Failure Attribution
Top-1 Accuracy100
4
Single-agent tool selectionAgents Failure Attribution
Top-1 Accuracy95.3
4
Showing 2 of 2 rows