Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Audio Visual Question Answering on Music-AVQA (Robustness Evaluation)
Loading...
80.7
Music-AVQA Clean Accuracy
Negative Language Modeling Loss
76.665
78.6825
80.7
82.7175
Jan 20, 2026
Music-AVQA Clean Accuracy
Music-AVQA Attack Accuracy
Music-AVQA Accuracy Drop
Updated 4d ago
Evaluation Results
Method
Method
Links
Music-AVQA Clean Accuracy
Music-AVQA Attack Accuracy
Music-AVQA Accuracy Drop
Negative Language Modeling Loss
Objective=L_negLM
2026.01
80.7
74.3
6.4
Encoder-Based Cosine Similarity Loss
Objective=L^(cos)
2026.01
80.7
74.7
6
Vision Attention Suppression Loss
Objective=L^(visionatt)
2026.01
80.7
77.7
3
Audio Attention Amplification Loss
Objective=L^(audioatt)
2026.01
80.7
76.9
3.8
Attention Randomization Loss
Objective=L^(randatt)
2026.01
80.7
75.8
4.9
Hidden-State Similarity Loss
Objective=L^(hidden-cos)
2026.01
80.7
76.6
4.1
Combined Loss (SOUNDBREAK)
Objective=L^(combined)
2026.01
80.7
75
5.7
Feedback
Search any
task
Search any
task