Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MPE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-agent Offline Reinforcement LearningMPE CN (Medium-replay)
Score95.4
16
Multi-agent Offline Reinforcement LearningMPE CN (Random)
Score88.3
16
Multi-Agent Reinforcement LearningMPE Adversary
Return19.1
11
Multi-Agent Reinforcement LearningMPE Simple Spread
Return-46
11
Multi-Agent Reinforcement LearningMPE Speaker-Listener
Return-46
11
Offline Multi-Agent Reinforcement LearningMPE World (Random)
Average Normalized Score94.3
8
Multi-Agent Reinforcement LearningMPE pz-mpe-simple-adversary
Params (K)124.36
5
Multi-Agent Reinforcement LearningMPE pz-mpe-simple-tag
Params (K)102.28
5
Multi-Agent Reinforcement LearningMPE pz-mpe-simple-spread
Number of learnable parameters (K)191.18
5
Simple SpreadMPE
Mean Episodic Reward-390.18
4
Multi-agent Reinforcement LearningMPE Predator-prey (PP) v1 (Expert)
Normalized Score118.2
4
Multi-agent Reinforcement LearningMPE Predator-prey (PP) v1 (Med-Rep)
Normalized Score71.1
4
Multi-agent Reinforcement LearningMPE Predator-prey (PP) v1 (Random)
Normalized Score78.5
4
Multi-agent Reinforcement LearningMPE Cooperative Navigation (CN) v1 (Expert)
Normalized Score114.9
4
Multi-agent Multi-objective Reinforcement LearningMPE
Hypervolume11,080.9224
3
Showing 15 of 15 rows