Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Offline Inverse Reinforcement Learning benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Offline Inverse Reinforcement Learning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
D4RL HalfCheetah Medium v2
Offline ML-IRL
Cumulative Reward
9,313.29
8
4d ago
MuJoCo walker2d medium-exp
Expert Performance
Average Reward
5,383.98
5
4d ago
MuJoCo halfcheetah (medium-exp)
Expert Performance
Average Reward
12,174.61
5
4d ago
MuJoCo hopper (medium-exp)
Expert Performance
Average Reward
3,512.09
5
4d ago
MuJoCo walker2d (medium-replay)
Expert Performance
Avg Reward
5,383.98
5
4d ago
MuJoCo halfcheetah (medium-replay)
Expert Performance
Average Reward
12,174.61
5
4d ago
MuJoCo hopper (medium-replay)
Expert Performance
Average Reward
3,512.09
5
4d ago
MuJoCo walker2d medium
Expert Performance
Avg Reward
5,383.98
5
4d ago
MuJoCo halfcheetah (medium)
Expert Performance
Average Reward
12,174.61
5
4d ago
MuJoCo hopper medium
Expert Performance
Average Reward
3,512.09
5
4d ago
D4RL Walker2d Medium-Expert v2
Offline ML-IRL
Cumulative Reward
4,049.43
4
4d ago
D4RL HalfCheetah v2 (medium-expert)
Offline ML-IRL
Cumulative Reward
10,812.15
4
4d ago
D4RL Hopper Medium-Expert v2
Offline ML-IRL
Cumulative Reward
3,366.23
4
4d ago
D4RL Walker2d Medium-Replay v2
Offline ML-IRL
Cumulative Reward
4,100.99
4
4d ago
D4RL Hopper Medium-Replay v2
ValueDICE
Cumulative Reward
2,417.83
4
4d ago
D4RL Walker2d Medium v2
Offline ML-IRL
Cumulative Reward
4,121.68
4
4d ago
D4RL Hopper Medium v2
ValueDICE
Cumulative Reward
2,417.83
4
4d ago
Showing 17 of 17 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Terms of Service
FAQs