Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on AdroitHandDoor v1
Loading...
1,725
Average Return
Causal PBRS
-109.872
366.489
842.85
1,319.211
Sep 24, 2025
Oct 17, 2025
Nov 9, 2025
Dec 2, 2025
Dec 25, 2025
Jan 17, 2026
Feb 10, 2026
Average Return
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Return
Causal PBRS
State Dim Removed=1
2026.02
1,725
T-REX
State Dim Removed=1
2026.02
1,659
SAC Baseline
State Dim Removed=1
2026.02
1,472
Causal PBRS
State Dim Removed=1
2026.02
1,289
CQL
State Dim Removed=1
2026.02
415
CQL
State Dim Removed=1
2026.02
308
Trex
State Dim Removed=1
2026.02
105
Baseline
State Dim Removed=1
2026.02
71
Recurrent SAC
State Dim Removed=1
2026.02
-27
Recurrent SAC
State Dim Removed=1
2026.02
-27
Log-barrier DDPG
2025.09
-36
DDPG
2025.09
-39.3
Feedback
Search any
task
Search any
task