Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Zero-Shot Coordination on Overcooked Unident_s environment (test)
Loading...
78.5
Sparse Reward
Surrogate Network
20.26
35.38
50.5
65.62
Apr 28, 2026
Sparse Reward
Relative Performance vs Base (ens.)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Sparse Reward
Relative Performance vs Base (ens.)
Surrogate Network
2026.04
78.5
119.2
LLM-Based
2026.04
72.5
102.5
Stratified Grid
2026.04
69.7
94.7
Random
2026.04
54.2
51.4
Baseline (ensembled)
Ensemble=true
2026.04
35.8
-
HSP
2026.04
24.9
-30.4
Baseline
Ensemble=false
2026.04
22.5
-37.2
Feedback
Search any
task
Search any
task