Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cleanup

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-agent policy synthesisCleanup
U Score2.75
9
Multi-agent Social Dilemma Equality EvaluationCleanup
Equality Score (E)95.9
9
Robot Plan ExecutionCleanup real-world
Success Rate (New Objects)3
2
Showing 3 of 3 rows