Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AVSD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video DialogueAVSD DSTC8 (test)
BLEU-447.5
24
Audio-Visual Scene-Aware DialogAVSD (test)
CIDEr1.605
11
Audio-Video UnderstandingAVSD (test)
Accuracy62.8
9
Audio-Visual Scene-aware DialogAVSD (val)
ASR (%)59.48
7
Open-Ended Audio-Video QAAVSD
Accuracy57.2
7
Audio-Visual Question AnsweringAVSD 1 (test)
CIDEr152.9
6
Audio-Visual Question AnsweringAVSD
Accuracy54.8
6
Video DialogueAVSD DSTC7 (test)
BLEU-178.9
6
Video DialogueAVSD DSTC10 (test)
CIDEr103.3
6
Video DialogAVSD DSTC7
BLEU-155.5
6
Video DialogAVSD DSTC10
BLEU-10.546
6
Audio-Visual Question AnsweringAVSD (test)
CIDEr108.5
6
Response GenerationAVSD
CIDEr85.1
4
Showing 13 of 13 rows