Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Model Evaluation on Meta-World (train)

0.97Procedural Alignment Correlation (ρ)

FLORA

-0.1740.1230.420.717May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
0.970.80.8
2026.05
0.890.530.16
2026.05
0.860.010.03
2026.05
0.780.420.27
2026.05
-0.13-0.03-0.01