| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio-visual segmentation | AVSS V2 | MJ Score67.7 | 9 | |
| Audio-Visual Semantic Segmentation | AVSS | MJ Score31.9 | 7 | |
| Audio-Visual Segmentation | AVSS Binary | mIoU (MJ)65.9 | 6 | |
| Audio-Visual Semantic Segmentation | AVSS (test) | mIoU37.4 | 4 |