Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Paired-prompt evaluation on BEAF (sample)

90.67Simple Accuracy

LLaVA-NeXT-Vicuna-7B

89.900490.100290.390.4998Jan 18, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
90.6787.7231.5
2026.01
89.9386.7935.34