Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Paired-prompt evaluation on BEAF (sample)
Loading...
90.67
Simple Accuracy
LLaVA-NeXT-Vicuna-7B
89.9004
90.1002
90.3
90.4998
Jan 18, 2026
Simple Accuracy
Paired Accuracy
Yes-Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Simple Accuracy
Paired Accuracy
Yes-Rate
LLaVA-NeXT-Vicuna-7B
Intervention=Q4 system...
2026.01
90.67
87.72
31.5
LLaVA-NeXT-Vicuna-7B
Intervention=No-interv...
2026.01
89.93
86.79
35.34
Feedback
Search any
task
Search any
task