Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

About

Speech processing Universal PERformance Benchmark (SUPERB) is a leaderboard to benchmark the performance of Self-Supervised Learning (SSL) models on various speech processing tasks. However, SUPERB largely considers English speech in its evaluation. This paper presents multilingual SUPERB (ML-SUPERB), covering 143 languages (ranging from high-resource to endangered), and considering both automatic speech recognition and language identification. Following the concept of SUPERB, ML-SUPERB utilizes frozen SSL features and employs a simple framework for multilingual tasks by learning a shallow downstream model. Similar to the SUPERB benchmark, we find speech SSL models can significantly improve performance compared to FBANK features. Furthermore, we find that multilingual models do not always perform better than their monolingual counterparts. We will release ML-SUPERB as a challenge with organized datasets and reproducible training scripts for future multilingual representation research.

Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe• 2023

Related benchmarks

TaskDatasetResultRank
Automatic Speech RecognitionML-SUPERB 10-min Normal
CER29
26
Language IdentificationML-SUPERB 10-min Normal
LID Accuracy89.1
18
Automatic Speech Recognition10-min ML-SUPERB Few-shots
ASR CER39
12
Language IdentificationML-SUPERB 1hr Normal
Accuracy90.9
10
Automatic Speech RecognitionML-SUPERB 1hr Normal
CER22.7
10
Speaker VerificationVoxCeleb 10min context Normal
EER1.29
10
Speaker VerificationVoxCeleb 1hr context Normal
EER0.0129
10
Speaker VerificationVoxCeleb
EER1.29
8
Automatic Speech RecognitionML-SUPERB 10-min Few-shots 1.0
ASR CER39
4
Language IdentificationML-SUPERB 10-min Few-shots
LID Acc83.9
4
Showing 10 of 11 rows

Other info

Follow for update