Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Pusher
Loading...
142
Average Returns
Multi-task
-1,135.432
-803.791
-472.15
-140.509
Sep 25, 2020
Aug 26, 2021
Jul 27, 2022
Jun 28, 2023
May 28, 2024
Apr 28, 2025
Mar 30, 2026
Average Returns
Updated 7d ago
Evaluation Results
Method
Method
Links
Average Returns
Multi-task
Target Network=MLP (2x...
2020.09
142
HyperCRL
Target Network=MLP (2x...
2020.09
99
Coreset
Target Network=MLP (2x...
2020.09
87
SI
Target Network=MLP (2x...
2020.09
40
DF-CWP-CP
Number of training see...
2026.03
39.88
A2C
Number of training see...
2026.03
32.41
CG-FPD
Number of training see...
2026.03
27.23
PPO
Number of training see...
2026.03
25.5
SAC
Number of training see...
2026.03
25.5
EWC
Target Network=MLP (2x...
2020.09
7
Finetuning
Target Network=MLP (2x...
2020.09
0
SMAC
batch size=1000, seeds=5
2026.01
-408.2
AC-SGD
batch size=1000, seeds=5
2026.01
-433.6
AC-CG
batch size=1000, seeds=5
2026.01
-441
AC-Adam
batch size=1000, seeds=5
2026.01
-568.3
AC-KFAC
batch size=1000, seeds=5
2026.01
-1,086.3
Feedback
Search any
task
Search any
task