Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta-Reinforcement Learning on Walker-Rand-Params sampled 10 unseen (test)

344.2Average Return

CSRO

Updated 5mo ago

Evaluation Results

Method	Links
CSRO 2023.11		344.2
CSRO 2023.11		319.7
CORRO 2023.11		312.5
OffPearl 2023.11		284.5
CORRO 2023.11		275.2
OffPearl 2023.11		262
BOREL 2023.11		260.6
FOCAL 2023.11		253.3
FOCAL 2023.11		247.5
BOREL 2023.11		245.8