Share your thoughts, 1 month free Claude Pro on usSee more

Reinforcement Learning on D4RL Ant Medium

94.25D4RL Score

Transformer

Updated 2mo ago

Evaluation Results

Method	Links
Transformer 2024.05		94.25
Aaren 2024.05		93.29
Adaptive Policy Selection and Fine-Tuning 2026.05		82.3
Best 2026.05		69.7
OE 2026.05		69.7
OPE 2026.05		33.6
FT 2026.05		22.8