| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| URO-Bench Chinese Basic Track | Step-Audio 2 mini | Repeat Score99.83 | 15 | 1mo ago | |
| URO-Bench Basic Track | GLM-4-Voice | Understanding Accuracy85.82 | 12 | 24d ago | |
| MultiDialog 1.0 (test) | SpeechGPT | PPL930.401 | 8 | 1mo ago | |
| Big Bench Audio (test) | MiMo-Audio-7B-Instruct | S2T Accuracy72.9 | 6 | 1mo ago | |
| MultiChallenge Audio (test) | MiMo-Audio-7B-Instruct | S2T Score15.15 | 5 | 1mo ago |