Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Agent Coordination on XOR game Generalization (Train n=2, k=3; Eval n=3, k=3)

0Success Rate (Greedy Policy)

MAPPO

-0.001-0.000500.0005May 7, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
022
2026.05
022
2026.05
022
2026.05
022
2026.05
022
2026.05
022
2026.05
044
2026.05
00
2026.05
022
2026.05
022