Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Continuous Control on Swimmer v5
Loading...
119.3
Average Episodic Reward
TRFP(ours)
51.596
69.173
86.75
104.327
Mar 10, 2026
Mar 15, 2026
Mar 20, 2026
Mar 25, 2026
Mar 30, 2026
Apr 4, 2026
Apr 10, 2026
Average Episodic Reward
Updated 6d ago
Evaluation Results
Method
Method
Links
Average Episodic Reward
TRFP(ours)
NFE=4 × 4
2026.04
119.3
TRFP(one-step)
NFE=1 × 4
2026.04
118.9
PDA
Environment steps=1M,...
2026.03
111.3
SDAC
NFE=20 × 32
2026.04
85.8
MaxEntDP
NFE=20 × 10
2026.04
75.8
TD3
NFE=1
2026.04
58.9
SAC
NFE=1
2026.04
58.5
PPO
Environment steps=1M,...
2026.03
54.2
Feedback
Search any
task
Search any
task