Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robotic Control on Ant MDP
Loading...
5,614.1
Max Average Return
SAC
271.5056
1,658.5253
3,045.545
4,432.5647
Sep 12, 2022
Max Average Return
t-test P-value (PPO vs Baseline)
t-test P-value (Original vs Baseline)
Updated 25d ago
Evaluation Results
Method
Method
Links
Max Average Return
t-test P-value (PPO vs Baseline)
t-test P-value (Original vs Baseline)
SAC
2022.09
5,614.1
4
-
MSAC
History Length=5
2022.09
5,000.62
4
4
TD3
2022.09
4,773.7
4
-
LSTM-TD3
History Length=5
2022.09
4,070.03
4
4
MTD3
History Length=5
2022.09
3,236.93
4
4
PPO
2022.09
476.99
-
-
Feedback
Search any
task
Search any
task