Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Audio-visual video parsing on LLP (val)
Loading...
63.5
Segment-Level Audio Accuracy
BL+RegBN
61.316
61.883
62.45
63.017
Oct 1, 2023
Segment-Level Audio Accuracy
Segment-Level Visual Accuracy
Segment-Level Audio-Visual Accuracy
Segment-Level Type Accuracy
Segment-Level Event Accuracy
Event-Level Audio Accuracy
Event-Level Visual Accuracy
Event-Level Audio-Visual Accuracy
Event-Level Type Accuracy
Event-Level Event Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Segment-Level Audio Accuracy
Segment-Level Visual Accuracy
Segment-Level Audio-Visual Accuracy
Segment-Level Type Accuracy
Segment-Level Event Accuracy
Event-Level Audio Accuracy
Event-Level Visual Accuracy
Event-Level Audio-Visual Accuracy
Event-Level Type Accuracy
Event-Level Event Accuracy
BL+RegBN
Backbone=AVVP, Normali...
2023.10
63.5
55.3
49.1
55
58
52.5
51.1
44
49
50.9
BL
Backbone=AVVP, Learnab...
2023.10
61.8
54.5
49
55.1
57.4
53.6
49.9
43.3
49.4
49.8
BL+PMDN
Backbone=AVVP, Normali...
2023.10
61.4
54.6
48.9
54.8
57.4
52.8
50.5
43.3
49.3
50.2
Feedback
Search any
task
Search any
task