Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on D4RL Ant Medium
Loading...
94.25
D4RL Score
Transformer
19.942
39.2335
58.525
77.8165
May 22, 2024
Sep 18, 2024
Jan 15, 2025
May 14, 2025
Sep 10, 2025
Jan 7, 2026
May 6, 2026
D4RL Score
Updated 27d ago
Evaluation Results
Method
Method
Links
D4RL Score
Transformer
2024.05
94.25
Aaren
2024.05
93.29
Adaptive Policy Selection and Fine-Tuning
Phase=Online, Online I...
2026.05
82.3
Best
Phase=Offline, Online...
2026.05
69.7
OE
Phase=Offline, Online...
2026.05
69.7
OPE
Phase=Offline, Online...
2026.05
33.6
FT
Phase=Online, Online I...
2026.05
22.8
Feedback
Search any
task
Search any
task