Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Goal Reaching on MiniGrid 8x8 Coordinates (test)
Loading...
0.91
Average Reward
PPO (No defense)
0.5772
0.6636
0.75
0.8364
Jun 6, 2024
Average Reward
Clean Reward
Max Reward (SA-RL)
Max Reward (PA-AD)
Reward (BIA-ILfD)
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Reward
Clean Reward
Max Reward (SA-RL)
Max Reward (PA-AD)
Reward (BIA-ILfD)
PPO (No defense)
Target Reward=0.96 ± 0
2024.06
0.91
-
-
-
-
TDRT-PPO
Target Reward=0.96 ± 0
2024.06
0.59
-
-
-
-
Feedback
Search any
task
Search any
task