Share your thoughts, 1 month free Claude Pro on usSee more

Multi-Task Reinforcement Learning on Meta-World MT50 V1 (final-checkpoint)

79.3Success Rate (IQM)

TOPPO

Updated 2mo ago

Evaluation Results

Method	Links
TOPPO 2026.05		79.3
MT-PPO + LN-c + PopArt + FG-c 2026.05		79
MT-PPO + LN-c + PopArt 2026.05		76.5
MT-PPO + LN-c 2026.05		64.3
MT-PPO + LN-c + FG-c 2026.05		62
MOORE 2026.05		61.8
Soft Modularization 2026.05		60.6
PCGrad-SAC 2026.05		45.8
Vanilla MT-PPO 2026.05		45.2
MT-MH-SAC 2026.05		31.9
PaCo 2026.05		18.6