Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Pairwise Preference Prediction on RBM-EVAL-OOD Different Quality Trajectory Pairs
Loading...
100
Preference Accuracy (USC Franka)
ROBOMETER
50.184
63.117
76.05
88.983
Mar 2, 2026
Preference Accuracy (USC Franka)
Preference Accuracy (USC Koch)
Preference Accuracy (USC Trossen)
Preference Accuracy (USC xArm)
Preference Accuracy (MIT Franka)
Preference Accuracy (UTD; SO-101)
Average Preference Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Preference Accuracy (USC Franka)
Preference Accuracy (USC Koch)
Preference Accuracy (USC Trossen)
Preference Accuracy (USC xArm)
Preference Accuracy (MIT Franka)
Preference Accuracy (UTD; SO-101)
Average Preference Accuracy
ROBOMETER
Trajectory pair type=D...
2026.03
100
89.8
99
98.2
98.4
100
97.6
ROBOMETER
2026.03
75
79.4
76.2
88.9
85.4
90
82.5
RL-VLM-F
Trajectory pair type=D...
2026.03
70.7
54.7
64
73.3
55.3
72.7
65.1
RL-VLM-F
model=GPT-4o-mini
2026.03
52.1
54.4
66.7
48.6
54.4
56.7
55.5
Feedback
Search any
task
Search any
task