| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Reasoning | Text-Audio-Vision Benchmark Full Set | Pass@1 Accuracy69 | 3 | |
| Multimodal Reasoning | Text-Audio-Vision Benchmark Level 5 | Pass@1 Acc46 | 3 | |
| Multimodal Reasoning | Text-Audio-Vision Benchmark Level 4 | Pass@1 Accuracy61 | 3 | |
| Multimodal Reasoning | Text-Audio-Vision Benchmark Level 3 | Pass@1 Accuracy65 | 3 | |
| Multimodal Reasoning | Text-Audio-Vision Benchmark Level 2 | Pass@1 Accuracy86 | 3 | |
| Multimodal Reasoning | Text-Audio-Vision Benchmark Level 1 | Pass@1 Accuracy92 | 3 |