Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-Vocabulary Audio-Visual Event Localization on OV-AVEBench (unseen)
Loading...
64.9
Accuracy
fine-tuning baseline
32.452
40.876
49.3
57.724
Nov 18, 2024
Accuracy
Segmentation Score
Event Localization Score
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Segmentation Score
Event Localization Score
Average Score
fine-tuning baseline
strategy=fine-tuning
2024.11
64.9
55
47.5
55.8
training-free baseline
strategy=training-free
2024.11
59.8
47.3
34
47
CLIP & CLAP
strategy=training-free
2024.11
51.6
42.2
31.6
41.8
Video-LLaMA2
strategy=training-free
2024.11
48.5
38.5
29
38.6
AVE
strategy=fine-tuning
2024.11
44.6
33.2
24
34
MM-Pyramid
strategy=fine-tuning
2024.11
36.8
29
23.8
29.9
CMRA
strategy=fine-tuning
2024.11
36
31
26.3
31.1
PSP
strategy=fine-tuning
2024.11
33.7
28.2
24.2
28.7
Feedback
Search any
task
Search any
task