Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Query-based video segmentation (Audio→Text) on AVSBench-S4 VGGSound-AVEL 90K

76.5mIoU

CoDAAR

74.73275.19175.6576.109May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
76.586.4
2026.05
76.486
2026.05
74.884.7