Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Cooperative Multi-Agent Cooking on Farm-to-Table Cooking Task 99 1.0
Loading...
85.26
C (%)
VillagerAgent
23.6816
39.6683
55.655
71.6417
Jun 9, 2024
C (%)
ACR
E (%/min)
B (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
C (%)
ACR
E (%/min)
B (%)
VillagerAgent
Base LLM=GPT-4-1106-pr...
2024.06
85.26
55.6
21.9
84.38
VillagerAgent
Base LLM=GPT-4-1106-pr...
2024.06
73.75
58.11
6.98
96.13
VillagerAgent
Base LLM=GLM-4
2024.06
46.84
54.07
4.79
75.46
AgentVerse
Base LLM=GPT-4-1106-pr...
2024.06
29.75
48.64
3.54
87.13
VillagerAgent
Base LLM=Gemini-Pro
2024.06
26.05
32.92
3.35
83.15
Feedback
Search any
task
Search any
task