| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio Modeling | Counting 7s v1 (test) | SNR33.58 | 49 | |
| Visual perception | Counting | Accuracy68.33 | 18 | |
| Multimodal Reasoning | Counting | Accuracy68.33 | 12 | |
| counting | counting | Accuracy95.9 | 7 | |
| Visual Reasoning | Counting | Avg@870.75 | 3 |