| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LibriSpeech (test-clean) | WavSLM-2k | Speaker Similarity0.918 | 11 | 2mo ago | |
| DnD Group Gesture (test) | PolySLGen | BERT Score0.508 | 10 | 1mo ago | |
| LibriSpeech | AG-REPA | WER3.45 | 8 | 3mo ago | |
| SALMon (human evaluation) | Flow-SLM | Sentiment Score3.86 | 8 | 3mo ago | |
| Seed | Raon-Speech | WER1.93 | 7 | 8d ago | |
| LibriSpeech clean | Raon-Speech | WER2.01 | 7 | 8d ago | |
| CV3-Eval ko | Raon-Speech | CER3.9 | 7 | 8d ago | |
| MiniMax ko | Raon-Speech | CER1.57 | 7 | 8d ago | |
| KSponSpeech-c | Raon-Speech | CER4.89 | 7 | 8d ago | |
| Accent+ | AUDIOBOX | JointCLAP0.596 | 5 | 3mo ago | |
| Expr | JointCLAP0.548 | 5 | 3mo ago | ||
| Long-Audio benchmark Chinese | Fish Audio S2 | CER5.95 | 4 | 2mo ago | |
| Long-Audio benchmark English | Fish Audio S2 | WER4.38 | 4 | 2mo ago | |
| ch2-sims v2 | WER1.17 | 4 | 2mo ago | ||
| SeedTTS ZH (test) | MiniCPM-o 4.5 | CER0.86 | 3 | 1mo ago | |
| Video A | Recursive Narrative Bank | Proportion96.67 | 1 | 14d ago |