Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reward Model Evaluation on Meta-World Position OOD
Loading...
0.85
Process Alignment ρ
FLORA
0.1012
0.2956
0.49
0.6844
May 21, 2026
Process Alignment ρ
Reward Rank
Reward Difference
Updated 12d ago
Evaluation Results
Method
Method
Links
Process Alignment ρ
Reward Rank
Reward Difference
FLORA
2026.05
0.85
0.57
0.46
VLC
2026.05
0.65
0.07
0.1
ReWiND-CT
2026.05
0.61
0.4
0.03
LIV-FT
Fine-tuned=true
2026.05
0.55
0.39
0.17
LIV
Fine-tuned=false
2026.05
0.13
-0.08
-0.01
Feedback
Search any
task
Search any
task