Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Paired-prompt evaluation on NaturalBench

67.81Simple Accuracy

LLaVA-NeXT-Vicuna-7B

65.064465.777266.4967.2028Jan 18, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
67.8137.8665.74
2026.01
65.1731.7677.17