Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reward Model Evaluation on Meta-World Viewpoint OOD
Loading...
0.88
Process Alignment ρ
FLORA
0.048
0.264
0.48
0.696
May 21, 2026
Process Alignment ρ
Reward Ranking
Reward Difference
Updated 12d ago
Evaluation Results
Method
Method
Links
Process Alignment ρ
Reward Ranking
Reward Difference
FLORA
2026.05
0.88
0.67
0.46
VLC
2026.05
0.75
0.1
0.08
LIV-FT
Fine-tuned=true
2026.05
0.69
0.38
0.12
ReWiND-CT
2026.05
0.65
0.41
0.03
LIV
Fine-tuned=false
2026.05
0.08
0.07
0.01
Feedback
Search any
task
Search any
task