| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Speech | MiniMax Multilingual 24 (test) | WER0.572 | 75 | |
| Persona Discrimination | MiniMax Cross-generator M2.5 | Persona Separability (Δ)0.281 | 16 | |
| Speech Generation | MiniMax ko | CER1.57 | 7 | |
| Speech Synthesis | Minimax Multilingual (test) | Arabic WER (%)1.665 | 4 |