| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| ASR | JSUT basic5000 | CER7.1 | 15 | |
| Text-to-Speech | JSUT JA (test) | CER0.13 | 5 | |
| Speech Restoration | JSUT JP (test) | DNSMOS3.57 | 5 | |
| Pronunciation Accuracy | JSUT (test) | Phoneme Error Rate (S)0.92 | 3 | |
| Automatic Speech Recognition | JSUT (our split) | CER18.7 | 2 |