| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Speech-to-speech translation | Fisher Spanish-English (test) | BLEU (Speech Input)90.5 | 55 | |
| Speech-to-speech translation | Fisher Spanish-English (dev) | BLEU (Speech)88.5 | 48 | |
| Speech-to-speech translation | Fisher Spanish-English (dev2) | ASR BLEU89.4 | 36 | |
| Speech-to-Speech Translation | Fisher Es→En (test) | ASR chrF70.2 | 10 | |
| Speech-to-Speech Translation | Fisher Es→En (dev) | ASR chrF69.5 | 10 | |
| Speaker-Attributed Automatic Speech Recognition | Fisher (test) | WDER0.9 | 4 | |
| ES-to-EN AST | Fisher (test) | BLEU64.7 | 4 | |
| Speaker-Attributed Automatic Speech Recognition | Fisher Global Meeting-level | DER15.21 | 4 | |
| Speaker-Attributed Automatic Speech Recognition | Fisher (local setting) | DER8.18 | 4 | |
| Fine-grained Score Accuracy | Fisher | Exact Accuracy64.76 | 1 | |
| Binary classification (Human vs Machine speech) | Fisher (Human-Human) OOD (test) | Accuracy98.44 | 1 |