Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ExORL

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot Reinforcement LearningExORL RND (Quadruped environment) v1 (test)
Jump Success758
12
Zero-shot Reinforcement LearningExORL RND Walker environment v1 (test)
Flip644
12
Offline Reinforcement LearningExORL
Cheetah Run Score104.7
9
Visual ControlExORL Jaco Zero-shot RND
Reach Top Left48
8
Visual ControlExORL Cheetah Zero-shot RND
Walk Score805
8
Zero-shot Reinforcement LearningExORL APS (Jaco environment) v1 (test)
Reach Bottom Left88
8
Zero-shot Reinforcement LearningExORL APS Cheetah environment v1 (test)
Run Backward383
8
Unsupervised Reinforcement LearningExORL jaco (4 tasks) zero-shot
Average Return23
6
Unsupervised Reinforcement LearningExORL quadruped zero-shot
Average Return645
6
Unsupervised Reinforcement LearningExORL cheetah (4 tasks) zero-shot
Average Return378
6
Unsupervised Reinforcement LearningExORL walker (4 tasks) zero-shot
Average Return619
6
Zero-shot Reinforcement LearningExORL APS Quadruped environment v1 (test)
Jump Score757
4
Zero-shot Reinforcement LearningExORL APS Walker environment v1 (test)
Flip Count573
4
Showing 13 of 13 rows