Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Vision-Language Reward Model Evaluation on MMRewardBench
Loading...
83.6
Accuracy
Selective LWE
79.752
80.751
81.75
82.749
Dec 7, 2025
Accuracy
Consistency
Pairwise Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Consistency
Pairwise Accuracy
Selective LWE
Relative Inference Cos...
2025.12
83.6
94.7
80.8
Majority Voting
Relative Inference Cos...
2025.12
82.8
89.1
76.9
TextGrad*
Relative Inference Cos...
2025.12
82.1
83.6
74.1
Sample-Specific Prompt
Relative Inference Cos...
2025.12
81.5
86.5
74.2
Dynamic Cheatsheet
Relative Inference Cos...
2025.12
81.1
90.1
76.4
Vanilla
Relative Inference Cos...
2025.12
80.8
86.3
74.7
CoT
Relative Inference Cos...
2025.12
80.8
87.4
74.9
LWE
Relative Inference Cos...
2025.12
79.9
84.6
72.7
Feedback
Search any
task
Search any
task