| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio-visual speech-to-text translation | MuAViC (test) | BLEU (EL->EN)21.7 | 23 | |
| Speech Recognition | MuAViC (test) | Arabic Score197.9 | 9 | |
| Audio-visual speech recognition | MuAViC Noise environment (test) | Accuracy (En)49.5 | 9 | |
| Audio-visual speech recognition | MuAViC Clean environment (test) | En Acc2.5 | 9 | |
| Speech-to-Speech Translation | MuAViC English-to-X | ASR-BLEU (ES)30.15 | 8 | |
| Speech-to-Speech Translation | MuAViC X-to-English | ASR-BLEU (Es->En)28.7 | 8 | |
| Audio-Visual Speech Recognition | MuAViC (test) | Accuracy (Ara)89.4 | 7 | |
| Visual Speech Translation | MuAViC | En->It Score17.9 | 6 | |
| Speech Recognition | MuAViC v1 (test) | WER (Ar)- | 0 |