| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Automatic Speech Recognition | AMI | WER9.89 | 28 | |
| Automatic Speech Recognition | AMI (test) | Word Error Rate11.2 | 24 | |
| Meeting Summarization | AMI manual transcriptions | ROUGE-1 Recall (R)42.43 | 22 | |
| Speaker Diarization | AMI | DER3.6 | 15 | |
| Topic Segmentation | AMI (test) | F1 Score30.34 | 10 | |
| Automatic Speech Recognition | AMI IHM | WER7.8 | 10 | |
| Meeting Summarization | AMI | ROUGE-153.44 | 10 | |
| Speaker Diarization | AMI MixHeadset | DER (%)1.73 | 10 | |
| Dialogue Summarization | AMI (test) | Conciseness4.13 | 9 | |
| Speaker Diarization | AMI Lapel | DER1.99 | 8 | |
| Automatic Speech Recognition | AMI SDM English (eval) | WER17.7 | 8 | |
| Multi-speaker Automatic Speech Recognition | AMI | CP-WER32.53 | 7 | |
| Multi-speaker Automatic Speech Recognition | AMI-IHM-Mix | WER28.4 | 7 | |
| Speech Recognition | AMI IHM (unlabeled) | WER10.09 | 6 | |
| Speech Recognition | AMI IHM (test) | WER (%)9.35 | 6 | |
| Joint ASR and Diarization | AMI SDM English (test) | Fail Rate1.27 | 5 | |
| Speaker Diarization | AMI (Eval) | DER (Mix-Headset)0.0167 | 5 | |
| Target-speaker Automatic Speech Recognition | AMI SDM | tcpWER14.3 | 5 | |
| Speaker Diarization | AMI Channel 1 | DER (%)15.4 | 5 | |
| Hierarchical Topic Segmentation | AMI (test) | Bhier8.5 | 5 | |
| Speaker-attributed Transcription | AMI MDM (eval) | cpWER31.5 | 5 | |
| Speaker-attributed Transcription | AMI-MDM (dev) | cpWER30.7 | 5 | |
| Target-speaker Automatic Speech Recognition | AMI IHM-Mix | tcpWER11 | 4 | |
| Meeting Summarization | AMI (test) | ROUGE-154.47 | 4 | |
| Speaker Diarization | AMI (dev) | Diarization Error (Mix-Headset)1.77 | 3 |