Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Paired-prompt evaluation on SugarCrepe
Loading...
64.56
Simple Accuracy
LLaVA-NeXT-Vicuna-7B
61.7624
62.4887
63.215
63.9413
Jan 18, 2026
Simple Accuracy
Paired Accuracy
Yes Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Simple Accuracy
Paired Accuracy
Yes Rate
LLaVA-NeXT-Vicuna-7B
Intervention=Q4 system...
2026.01
64.56
29.23
83.24
LLaVA-NeXT-Vicuna-7B
Intervention=No-interv...
2026.01
61.87
23.85
86.81
Feedback
Search any
task
Search any
task