Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cooperative Cooking on Overcooked-AI Asymmetric Advantages
Loading...
381
J Score
Diagnostic-Grounded Search
225
265.5
306
346.5
Mar 25, 2026
J Score
Deliveries
Invalid Deliveries
Updated 24d ago
Evaluation Results
Method
Method
Links
J Score
Deliveries
Invalid Deliveries
Diagnostic-Grounded Search
Generation=Gen 2
2026.03
381
19.05
1.55
Diagnostic-Grounded Search
Generation=Gen 1
2026.03
322
16.1
3.45
MAPPO
Generation=Baseline
2026.03
231
11.55
6.15
Feedback
Search any
task
Search any
task