Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Offline Multi-Agent Reinforcement Learning benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Offline Multi-Agent Reinforcement Learning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Multi-agent MuJoCo Hopper expert, medium, medium-replay, medium-expert
FACMAC+BC
Return
3,621
12
4d ago
Warehouse Small (11x20)
AlberDICE
Mean Performance (N=2)
5.97
6
4d ago
Warehouse Tiny (11x11)
AlberDICE
Mean Performance (N=2)
11.15
6
4d ago
Bridge (Mix)
AlberDICE
Mean Return
-1.29
6
4d ago
Bridge Optimal
AlberDICE
Mean Return
-1.27
6
4d ago
Multi-agent MuJoCo Swimmer (e, m1, m2, e-m1, e-m2, m1-m2)
OMIGA
Return
430.7
5
4d ago
Multi-agent MuJoCo HalfCheetah k=0 (e, m1, m2, e-m1, e-m2, m1-m2)
FACMAC+B3C
Return
1,396.8
5
4d ago
Multi-agent MuJoCo HalfCheetah expert, medium, medium-replay, medium-expert
FACMAC+B3C
Return
5,413.7
5
4d ago
Multi-agent MuJoCo Ant expert, medium, medium-replay, medium-expert
FACMAC+B3C
Return
2,162.8
5
4d ago
SMAC corridor (medium-poor)
OMIGA
Average Score
9.7
5
4d ago
SMAC corridor (good-medium)
OMIGA
Average Score
14.02
5
4d ago
SMAC corridor (good-poor)
OMIGA
Average Score
13.01
5
4d ago
SMAC 6h_vs_8z (medium-poor)
OMIGA
Average Score
11.85
5
4d ago
SMAC 6h_vs_8z (good-medium)
OMIGA
Average Score
12.05
5
4d ago
SMAC 6h_vs_8z (good-poor)
OMIGA
Average Score
11.88
5
4d ago
Multi-agent MuJoCo HalfCheetah k=1 (e, m1, m2, e-m1, e-m2, m1-m2)
FACMAC+B3C
Return
3,760.5
4
4d ago
Showing 16 of 16 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Terms of Service
FAQs