Meta-World+: An Improved, Standardized, RL Benchmark

About

Meta-World is widely used for evaluating multi-task and meta-reinforcement learning agents, which are challenged to master diverse skills simultaneously. Since its introduction however, there have been numerous undocumented changes which inhibit a fair comparison of algorithms. This work strives to disambiguate these results from the literature, while also leveraging the past versions of Meta-World to provide insights into multi-task and meta-reinforcement learning benchmark design. Through this process we release a new open-source version of Meta-World (https://github.com/Farama-Foundation/Metaworld/) that has full reproducibility of past results, is more technically ergonomic, and gives users more control over the tasks that are included in a task set.

Reginald McLean, Evangelos Chatzaroulas, Luc McCutcheon, Frank R\"oder, Tianhe Yu, Zhanpeng He, K.R. Zentner, Ryan Julian, J K Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro• 2025

Related benchmarks

Task	Dataset	Result
Multi-task reinforcement learning	Meta-World MT50 v2	Overall Success Rate64.2	16
Multi-task reinforcement learning	Meta-World MT10 V2	Success Rate86	15
Multi-task reinforcement learning	Meta-World MT50 V1 (final-checkpoint)	Success Rate (IQM)61.8	11

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord