Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Human Evaluation on A-OKVQA (test)

7.83Faithfulness Score

MMBoundary

4.05485.03496.0156.9951May 29, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.05
7.837.258.187.75
2025.05
7.287.496.477.08
2025.05
6.736.587.416.9
2025.05
6.546.136.956.54
2025.05
6.475.735.826.01
4.25.174.064.47