Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline-to-Online Reinforcement Learning on relocate cloned v1
Loading...
0.44
Average Online Expected Return
DUAL
-0.3088
-0.1144
0.08
0.2744
May 29, 2026
Average Online Expected Return
Updated 2d ago
Evaluation Results
Method
Method
Links
Average Online Expected Return
DUAL
Critic framework=IQL,...
2026.05
0.44
Diff-QL
Critic framework=IQL,...
2026.05
0.23
EDIS
Critic framework=IQL,...
2026.05
0.14
Base
Critic framework=IQL,...
2026.05
0.1
DUAL
Critic framework=Cal-Q...
2026.05
-0.12
EDIS
Critic framework=Cal-Q...
2026.05
-0.24
Diff-QL
Critic framework=Cal-Q...
2026.05
-0.26
Base
Critic framework=Cal-Q...
2026.05
-0.28
Feedback
Search any
task
Search any
task