Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Task Reinforcement Learning on Meta-World MT50 V1 (final-checkpoint)
Loading...
79.3
Success Rate (IQM)
TOPPO
16.172
32.561
48.95
65.339
May 12, 2026
Success Rate (IQM)
Updated 21d ago
Evaluation Results
Method
Method
Links
Success Rate (IQM)
TOPPO
Params=717K, Backbone=...
2026.05
79.3
MT-PPO + LN-c + PopArt + FG-c
Params=717K, Backbone=...
2026.05
79
MT-PPO + LN-c + PopArt
Params=717K, Backbone=...
2026.05
76.5
MT-PPO + LN-c
Params=717K, Backbone=...
2026.05
64.3
MT-PPO + LN-c + FG-c
Params=717K, Backbone=...
2026.05
62
MOORE
Params=10385K, TOPPO s...
2026.05
61.8
Soft Modularization
Params=8030K, TOPPO st...
2026.05
60.6
PCGrad-SAC
Params=2031K, TOPPO st...
2026.05
45.8
Vanilla MT-PPO
Params=716K, Backbone=...
2026.05
45.2
MT-MH-SAC
Params=2031K, TOPPO st...
2026.05
31.9
PaCo
Params=33909K, TOPPO s...
2026.05
18.6
Feedback
Search any
task
Search any
task