| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Automatic Speech Recognition | AMI | WER8.43 | 35 | |
| Speaker Diarization | AMI | DER3.6 | 24 | |
| Automatic Speech Recognition | AMI (test) | Word Error Rate11.2 | 24 | |
| Meeting Summarization | AMI manual transcriptions | ROUGE-1 Recall (R)42.43 | 22 | |
| Context-aware turn-taking | AMI | I1 Score92.41 | 16 | |
| Multi-speaker Automatic Speech Recognition | AMI IHM (test) | cpWER16.4 | 12 | |
| Automatic Speech Recognition | AMI IHM | WER7.8 | 12 | |
| Multi-speaker Automatic Speech Recognition | AMI | CP-WER24.62 | 11 | |
| Multi-speaker Automatic Speech Recognition | AMI SDM (test) | DER13.43 | 10 | |
| Topic Segmentation | AMI (test) | F1 Score30.34 | 10 | |
| Meeting Summarization | AMI | ROUGE-153.44 | 10 | |
| Speaker Diarization | AMI MixHeadset | DER (%)1.73 | 10 | |
| Speaker Diarization | AMI 30s | DER21.3 | 9 | |
| Dialogue Summarization | AMI (test) | Conciseness4.13 | 9 | |
| Speaker Diarization | AMI Lapel | DER1.99 | 8 | |
| Automatic Speech Recognition | AMI SDM English (eval) | WER17.7 | 8 | |
| Speaker-attributed Automatic Speech Recognition | AMI SDM | WER17.49 | 7 | |
| Multi-speaker Automatic Speech Recognition | AMI-IHM-Mix | WER28.4 | 7 | |
| Speech Recognition and Diarization | AMI IHM | WER19.21 | 6 | |
| Speech Recognition with Timestamps | AMI | AAS64.8 | 6 | |
| Speech Recognition | AMI IHM (unlabeled) | WER10.09 | 6 | |
| Speech Recognition | AMI IHM (test) | WER (%)9.35 | 6 | |
| Keyword Spotting | AMI | FAR0.007 | 5 | |
| Joint ASR and Diarization | AMI SDM English (test) | Fail Rate1.27 | 5 | |
| Speaker Diarization | AMI (Eval) | DER (Mix-Headset)0.0167 | 5 |