Share your thoughts, 1 month free Claude Pro on usSee more

Multi-agent Reinforcement Learning on Predator-Prey Orthogonal (PP-O)

41.4Cumulative Reward

ML

Updated 2mo ago

Evaluation Results

Method	Links
ML 2026.05		41.4
ML 2026.05		40.6
ST 2026.05		40.5
NS 2026.05		40
NS 2026.05		40
VL 2026.05		39.9
VL 2026.05		39.8
J-W 2026.05		39.7
J-M 2026.05		39.5
J-M 2026.05		39.4
ST 2026.05		39.4
MMR 2026.05		39.3
MMR 2026.05		39.3
ST 2026.05		39.2
CBTS 2026.05		39
NS 2026.05		38.9
J-M 2026.05		38.7
J-W 2026.05		38.7
CBTS 2026.05		38.7
J-W 2026.05		38.6
VL 2026.05		38.5
CBTS 2026.05		38.2
MMR 2026.05		37.9
ML 2026.05		36.6