Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Speech-Visual Question Answering on VQA Speech-converted v2

32.95Accuracy

Single-Modality-Expert-Task3

1.10529.372617.6425.9074May 18, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.05
32.95
2024.05
32.6
2024.05
31.28
2024.05
30.86
2024.05
26.47
2024.05
26.01
2024.05
25.11
2024.05
24.3
2024.05
20.14
2024.05
12.68
2024.05
5.9
2024.05
2.33