Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Offline Multi-Agent Reinforcement Learning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Offline Multi-Agent Reinforcement Learning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Multi-agent MuJoCo Hopper expert, medium, medium-replay, medium-expert
FACMAC+BC
Return
3,621
12
3mo ago
MPE World (Random)
SPaCQL
Average Normalized Score
94.3
8
2mo ago
MPE World (Medium)
PLCQL
Average Normalized Score
104.9
8
2mo ago
SMAC Expert Marine-Hard
HiSSD
Performance at 3m
99.4
8
2mo ago
MaMuJoCo Half-C (Random)
SPaCQL
Average Normalized Score
43.8
7
2mo ago
MaMuJoCo Half-C (Medium-Replay)
PLCQL
Average Normalized Score
73.1
7
2mo ago
MaMuJoCo Half-C Medium
IQL
Avg Normalized Score
81.3
7
2mo ago
MaMuJoCo Half-C Expert
CFCQL
Average Normalized Score
118.5
7
2mo ago
MaMuJoCo 2-HalfCheetah (Random)
CFCQL
Average Return
39.7
6
5d ago
MaMuJoCo 2-HalfCheetah (Med-Replay)
OMSD
Average Return
78.9
6
5d ago
MaMuJoCo 2-HalfCheetah (Expert)
OMSD
Average Return
119
6
5d ago
Warehouse Small (11x20)
AlberDICE
Mean Performance (N=2)
5.97
6
3mo ago
Warehouse Tiny (11x11)
AlberDICE
Mean Performance (N=2)
11.15
6
3mo ago
Bridge (Mix)
AlberDICE
Mean Return
-1.29
6
3mo ago
Bridge Optimal
AlberDICE
Mean Return
-1.27
6
3mo ago
SMAC
DLM-GRPO
3s5z Win Rate
97
5
1mo ago
Multi-agent MuJoCo Swimmer (e, m1, m2, e-m1, e-m2, m1-m2)
OMIGA
Return
430.7
5
3mo ago
Multi-agent MuJoCo HalfCheetah k=0 (e, m1, m2, e-m1, e-m2, m1-m2)
FACMAC+B3C
Return
1,396.8
5
3mo ago
Multi-agent MuJoCo HalfCheetah expert, medium, medium-replay, medium-expert
FACMAC+B3C
Return
5,413.7
5
3mo ago
Multi-agent MuJoCo Ant expert, medium, medium-replay, medium-expert
FACMAC+B3C
Return
2,162.8
5
3mo ago
SMAC corridor (medium-poor)
OMIGA
Average Score
9.7
5
3mo ago
SMAC corridor (good-medium)
OMIGA
Average Score
14.02
5
3mo ago
SMAC corridor (good-poor)
OMIGA
Average Score
13.01
5
3mo ago
SMAC 6h_vs_8z (medium-poor)
OMIGA
Average Score
11.85
5
3mo ago
SMAC 6h_vs_8z (good-medium)
OMIGA
Average Score
12.05
5
3mo ago
Showing 25 of 33 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs