Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Walker-Rand-Params

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Meta-Reinforcement LearningWalker-Rand-Params sampled 10 unseen (test)
Average Return344.2
10
Showing 1 of 1 rows