Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Paired-prompt evaluation on HallusionBench

52.89Simple Accuracy

LLaVA-NeXT-Vicuna-7B

47.419648.839850.2651.6802Jan 18, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
52.8916.7461.51
2026.01
47.6312.3387.17