Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Evaluation on A-OKVQA (test)
Loading...
7.83
Faithfulness Score
MMBoundary
4.0548
5.0349
6.015
6.9951
May 29, 2025
Faithfulness Score
Conciseness Score
Granularity Score
Average Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Faithfulness Score
Conciseness Score
Granularity Score
Average Score
MMBoundary
2025.05
7.83
7.25
8.18
7.75
SaySelf
2025.05
7.28
7.49
6.47
7.08
RCE
2025.05
6.73
6.58
7.41
6.9
DRL
2025.05
6.54
6.13
6.95
6.54
Conf-CSR
2025.05
6.47
5.73
5.82
6.01
Multisample
2025.05
4.2
5.17
4.06
4.47
Feedback
Search any
task
Search any
task