Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Paired-prompt evaluation on BEAF (sample)

90.67Simple Accuracy

LLaVA-NeXT-Vicuna-7B

89.900490.100290.390.4998Jan 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
90.6787.7231.5
2026.01
89.9386.7935.34