Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Humanoid v5
Loading...
5,906.7
Performance Score
SAC+DBC(*)
3,344.036
4,009.343
4,674.65
5,339.957
Feb 5, 2026
Performance Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Performance Score
SAC+DBC(*)
Algorithm=SAC, Critic=DBC
2026.02
5,906.7
QVPO+DBC(*)
Algorithm=QVPO, Critic...
2026.02
5,426.3
TD3+DBC(*)
Algorithm=TD3, Critic=DBC
2026.02
5,343.2
SAC+TQC
Algorithm=SAC, Critic=TQC
2026.02
5,269
SAC+CDQ
Algorithm=SAC, Critic=CDQ
2026.02
5,207.1
QVPO+CDQ
Algorithm=QVPO, Critic...
2026.02
5,068.3
TD3+CDQ
Algorithm=TD3, Critic=CDQ
2026.02
5,067.8
SAC+VF
Algorithm=SAC, Critic=VF
2026.02
4,950.3
SAC+VD
Algorithm=SAC, Critic=VD
2026.02
4,886.9
SAC+IQN
Algorithm=SAC, Critic=IQN
2026.02
4,729.2
SAC+DSAC
Algorithm=SAC, Critic=...
2026.02
3,442.6
Feedback
Search any
task
Search any
task