| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LibriSpeech (test-clean) | WavSLM-2k | Speaker Similarity0.918 | 11 | 1mo ago | |
| DnD Group Gesture (test) | PolySLGen | BERT Score0.508 | 10 | 9d ago | |
| LibriSpeech | AG-REPA | WER3.45 | 8 | 1mo ago | |
| SALMon (human evaluation) | Flow-SLM | Sentiment Score3.86 | 8 | 1mo ago | |
| Accent+ | AUDIOBOX | JointCLAP0.596 | 5 | 1mo ago | |
| Expr | JointCLAP0.548 | 5 | 1mo ago | ||
| Long-Audio benchmark Chinese | Fish Audio S2 | CER5.95 | 4 | 1mo ago | |
| Long-Audio benchmark English | Fish Audio S2 | WER4.38 | 4 | 1mo ago | |
| ch2-sims v2 | WER1.17 | 4 | 1mo ago |