Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-agent system task solving on workbench

82.4Accuracy

TacoMAS

26.86441.28255.770.118May 10, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
82.4
2026.05
65.1
2026.05
64.5
2026.05
59.5
2026.05
53.8
2026.05
52.3
2026.05
51.1
2026.05
49.2
2026.05
47.8
2026.05
47.3
2026.05
46.9
2026.05
44.6
2026.05
44.1
2026.05
44.1
2026.05
44
2026.05
41.9
2026.05
41.6
2026.05
38.6
2026.05
38.6
2026.05
34.7
2026.05
29