Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Speech-Visual Question Answering on VQA Speech-converted v2

32.95Accuracy

Single-Modality-Expert-Task3

1.10529.372617.6425.9074May 18, 2024
Updated 3d ago

Evaluation Results

MethodLinks
2024.05
32.95
2024.05
32.6
2024.05
31.28
2024.05
30.86
2024.05
26.47
2024.05
26.01
2024.05
25.11
2024.05
24.3
2024.05
20.14
2024.05
12.68
2024.05
5.9
2024.05
2.33