Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hallucination Assessment on RSHalluEval 1.0 (test)

0.9792HF Information Accuracy

Qwen2-VL

0.13420.3535750.572950.792325Feb 11, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
0.97920.95630.90220.83180.75940.8744
2026.02
0.91670.89380.75320.69750.47170.7286
2026.02
0.91670.94380.81370.6590.43870.7425
2026.02
0.9130.81250.75210.48530.80.6833
2026.02
0.89580.70.7520.36110.49060.6055
2026.02
0.85420.91250.78580.60340.46230.7084
2026.02
0.80560.83750.74040.62960.44340.6861
2026.02
0.750.54550.41670.3750.48330.4417
2026.02
0.72920.7250.59950.47530.41980.5601
2026.02
0.72730.86960.79460.48530.54170.67
2026.02
0.68060.9250.84280.7160.48110.7593
2026.02
0.67360.66250.78930.44750.45280.6263
2026.02
0.61810.78130.82420.58640.58020.7044
2026.02
0.61110.79410.74280.57450.68180.6783
2026.02
0.56250.31880.49940.30250.28770.4043
2026.02
0.3750.84620.55970.48570.53570.5317
2026.02
0.33330.5750.66010.35190.36790.5007
2026.02
0.27780.7750.82310.48460.55660.6441
2026.02
0.24310.30630.48780.27310.32550.3702
2026.02
0.16670.820.86610.5170.750.69
2026.02
0.16670.42110.43460.28980.42420.3633