Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Goal Reaching on MiniGrid 16x16 (Coordinates) (test)
Loading...
0.95
Average Performance
PPO (No defense)
0.4404
0.5727
0.705
0.8373
Jun 6, 2024
Average Performance
Clean Reward
Max Reward (SA-RL)
Max Reward (PA-AD)
BIA-ILfD (ours)
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Performance
Clean Reward
Max Reward (SA-RL)
Max Reward (PA-AD)
BIA-ILfD (ours)
PPO (No defense)
Target Reward=0.98 ± 0
2024.06
0.95
-
-
-
-
TDRT-PPO
Target Reward=0.98 ± 0
2024.06
0.46
-
-
-
-
Feedback
Search any
task
Search any
task