Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Audio-Visual Question Answering benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Audio-Visual Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MUSIC-AVQA 1.0 (test)
Sparsify
AV Localis Accuracy
85.09
96
4d ago
MUSIC-AVQA (test)
VAST
Acc (Avg)
80.7
59
4d ago
Music-AVQA
VideoLLaMA2
Accuracy
81.3
21
4d ago
video-SALMONN 2 (test)
Full Tokens
Miss Rate
29.1
18
4d ago
OmniVideoBench
Full Tokens
Accuracy
0.356
18
4d ago
WorldSense
OmniSIFT
Accuracy
50
18
4d ago
MUSIC-AVQA Bias v2.0 (test)
SHRIKE
Total Accuracy
77.33
18
4d ago
MUSIC-AVQA balanced v2.0 (test)
LAST-Att
Total Accuracy
75.44
18
4d ago
AVQA
CAT-7B
Accuracy
92
14
4d ago
AVQA (test)
JavisGPT
Total Accuracy
93.8
13
4d ago
MUSIC-AVQA-R (test)
QA-TIGER
Audio QA Count (Head)
82.67
13
4d ago
VALOR (test)
M3KG-RAG
M.J. Score
44.67
12
4d ago
AVQA (val)
MEERKAT
Existence Accuracy
88.24
9
4d ago
MUSIC-AVQA balanced (test)
MEERKAT
Existential Score
83.62
8
4d ago
Music-AVQA 2000 samples
Combined Loss
ASR Rate
13.8
7
4d ago
AVQA (subset 2000 samples)
Combined Loss
ASR Accuracy
96.03
7
4d ago
Music-AVQA
Negative Language Modeling Loss
Music-AVQA Clean Accuracy
80.7
7
4d ago
AVQA
Negative Language Modeling Loss
AVQA Clean Accuracy
95.6
7
4d ago
Music-AVQA 30 (test)
CAT-7B-FT
Overall Accuracy
84.3
7
4d ago
AVSD 1 (test)
PAVE-7B (w/ audio)
CIDEr
152.9
6
4d ago
AVSD
LongVALE-LLM
Accuracy
54.8
6
4d ago
VGGSound
Mirasol3B
Accuracy
69.8
6
4d ago
AVSD (test)
COST
CIDEr
108.5
6
4d ago
AVQA 69 (test)
PAVE-7B (w/ audio)
Accuracy
93.8
5
4d ago
VALOR (test)
VAST
CIDEr
62.2
5
4d ago
Showing 25 of 28 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Terms of Service
FAQs