| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Speaker Verification | VoxCeleb1 with MUSAN noise (test) | EER1.91 | 187 | |
| Speaker Recognition | VoxCeleb1 (test) | EER2.21 | 126 | |
| Speaker Verification | VoxCeleb1 (Vox1-O) | EER0.14 | 105 | |
| Speaker Verification | VoxCeleb1 (test) | Cosine EER0.856 | 80 | |
| Speaker Verification | VoxCeleb1 (Vox1-H) | EER0.55 | 70 | |
| Speaker Verification | VoxCeleb-E | EER0.27 | 62 | |
| Speaker Identification | VoxCeleb1 | Accuracy97.5 | 58 | |
| Speaker Verification | VoxCeleb1 Hard Cleaned | EER0.0099 | 45 | |
| Speaker Verification | VoxCeleb1 Cleaned (Extended) | EER (%)0.48 | 45 | |
| Speaker Verification | VoxCeleb1 with Nonspeech100 (test) | EER (%)2.19 | 36 | |
| Speaker Verification | VoxCeleb1 extended (test) | EER1.07 | 25 | |
| Speaker verification | VoxCeleb 1 (verification) | EER0.48 | 22 | |
| Speaker Identification | VoxCeleb 1 | Normalized Score65.4 | 20 | |
| Speaker Verification | VoxCeleb1 hard (H) | EER1.82 | 17 | |
| Speaker Verification | VoxCeleb1 extended | EER1.04 | 17 | |
| Audio-visual speech separation | VoxCeleb2 (test) | SI-SNRi14.6 | 16 | |
| Face Reenactment | VoxCeleb2 (test) | FID24.92 | 16 | |
| Face Reenactment | VoxCeleb1 (test) | SSIM0.804 | 16 | |
| Speaker Verification | VoxCeleb Hard 1 | EER (f-f)1.87 | 15 | |
| Speaker Verification | VoxCeleb Extended 1 | EER (f-f)1 | 15 | |
| Speech-to-Portrait | VoxCeleb (test) | L1 Error25.24 | 14 | |
| Speaker Recognition | VoxCeleb1 original (vox1-o) | EER (mean)0.24 | 13 | |
| Cross-modal verification | VoxCeleb1 (Unseen-Unheard) | AUC85 | 13 | |
| Open-set speaker identification | VoxCeleb2 (test) | EER0.44 | 12 | |
| Speech Separation | VoxCeleb2-2Mix (test) | SDRi13.1 | 12 |