Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Goal-conditioned navigation on MiniGrid 16x16 (RGB image input) (test)
Loading...
0.44
Clean Reward
TDRT-PPO
0.3776
0.3938
0.41
0.4262
Jun 6, 2024
Clean Reward
Attack Reward: Rew Max (SA-RL)
Attack Reward: Rew Max (PA-AD)
Attack Reward: BIA-ILfD (ours)
Attack Reward: Avg
Updated 4d ago
Evaluation Results
Method
Method
Links
Clean Reward
Attack Reward: Rew Max (SA-RL)
Attack Reward: Rew Max (PA-AD)
Attack Reward: BIA-ILfD (ours)
Attack Reward: Avg
TDRT-PPO
Target Reward=0.98 ± 0
2024.06
0.44
0.47
0.63
0.39
0.5
PPO
Target Reward=0.98 ± 0...
2024.06
0.38
0.49
0.94
0.48
0.64
Feedback
Search any
task
Search any
task