Meta-World+: An Improved, Standardized, RL Benchmark
About
Meta-World is widely used for evaluating multi-task and meta-reinforcement learning agents, which are challenged to master diverse skills simultaneously. Since its introduction however, there have been numerous undocumented changes which inhibit a fair comparison of algorithms. This work strives to disambiguate these results from the literature, while also leveraging the past versions of Meta-World to provide insights into multi-task and meta-reinforcement learning benchmark design. Through this process we release a new open-source version of Meta-World (https://github.com/Farama-Foundation/Metaworld/) that has full reproducibility of past results, is more technically ergonomic, and gives users more control over the tasks that are included in a task set.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-task reinforcement learning | Meta-World MT50 v2 | Overall Success Rate64.2 | 16 | |
| Multi-task reinforcement learning | Meta-World MT10 V2 | Success Rate86 | 15 | |
| Multi-task reinforcement learning | Meta-World MT50 V1 (final-checkpoint) | Success Rate (IQM)61.8 | 11 |