Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Gridworld (test)
Loading...
74.2
Usefulness
PPO DReST
63.072
65.961
68.85
71.739
Apr 19, 2026
Usefulness
Neutrality
Updated 1mo ago
Evaluation Results
Method
Method
Links
Usefulness
Neutrality
PPO DReST
RL Algorithm=PPO, Trai...
2026.04
74.2
74.7
A2C DReST
RL Algorithm=A2C, Trai...
2026.04
74.2
76.9
PPO Default
RL Algorithm=PPO, Trai...
2026.04
66.7
0
A2C Default
RL Algorithm=A2C, Trai...
2026.04
63.5
0
Feedback
Search any
task
Search any
task