Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline-to-Online Reinforcement Learning on D4RL Aggregate
Loading...
71.95
Average Normalized Score
ROAD
53.3548
58.1824
63.01
67.8376
May 14, 2026
Average Normalized Score
Updated 19d ago
Evaluation Results
Method
Method
Links
Average Normalized Score
ROAD
Mixing Ratio Strategy=...
2026.05
71.95
0.1
Mixing Ratio Strategy=...
2026.05
62.15
BR
Mixing Ratio Strategy=...
2026.05
61.8
Decreasing
Mixing Ratio Strategy=...
2026.05
59
0.3
Mixing Ratio Strategy=...
2026.05
58.66
Uniform
Mixing Ratio Strategy=...
2026.05
58.45
0.2
Mixing Ratio Strategy=...
2026.05
57.28
0.4
Mixing Ratio Strategy=...
2026.05
56.49
0.5
Mixing Ratio Strategy=...
2026.05
56.26
0.0
Mixing Ratio Strategy=...
2026.05
54.07
Feedback
Search any
task
Search any
task