Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hallucination Evaluation on Object HalBench (full benchmark)

25.2Ha (Living Room)

LLaVA-1.5

4.71210.03115.3520.669Dec 1, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.12
25.241.816.618.923.9522.430.4820.6287.49.2
2023.12
24.534.51016.420.84.421.617.5-4.122.5329.55
2023.12
23.734.510.813.117.44.318.219.51.418.322.74.45.2
2023.12
8.219.411.24.65.71.15.913.37.54.24.60.45
2023.12
5.582.53.85.92.14.14-0.12.34.62.31.7