| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio temporal grounding | AudioGrounding | R1@.390.1 | 10 | |
| Two-stage joint assessment | AudioGrounding | F1 Score85.6 | 5 | |
| Audio Event Presence Prediction | AudioGrounding | Positive Accuracy93.4 | 5 | |
| Text-to-audio Grounding | AudioGrounding TAG (evaluation) | PSDS20210.649 | 3 |