Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Ant (Zero-shot evaluation)

2,506,511Zero-shot Reward

Open-Ended Neural Reward Functions

2,381,185.452,443,848.2252,506,5112,569,173.775Feb 16, 2022
Updated 1mo ago

Evaluation Results

MethodLinks
2022.02
2,506,511-