Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reward modeling on EVAL_INSTRUCT 5 steps
Loading...
3.38
Step Completion Rate
R2VLM
2.5792
2.7871
2.995
3.2029
Mar 18, 2026
Step Completion Rate
Task Completion Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Step Completion Rate
Task Completion Rate
R2VLM
2026.03
3.38
23
Pretrained SPRINT
2026.03
3.31
23
Step-Completion Based Reward
2026.03
2.77
23
Qwen2.5-VL-Instruct
Model Size=7B
2026.03
2.61
23
Feedback
Search any
task
Search any
task