Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

relocate

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline-to-online Reinforcement Learningrelocate
Regret62.8
12
Offline-to-Online Reinforcement Learningrelocate cloned v1
Average Online Expected Return0.44
8
Showing 2 of 2 rows