Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ara-Best-RQ: Multi Dialectal Arabic SSL

About

We present Ara-BEST-RQ, a family of self-supervised learning (SSL) models specifically designed for multi-dialectal Arabic speech processing. Leveraging 5,640 hours of crawled Creative Commons speech and combining it with publicly available datasets, we pre-train conformer-based BEST-RQ models up to 600M parameters. Our models are evaluated on dialect identification (DID) and automatic speech recognition (ASR) tasks, achieving state-of-the-art performance on the former while using fewer parameters than competing models. We demonstrate that family-targeted pre-training on Arabic dialects significantly improves downstream performance compared to multilingual or monolingual models trained on non-Arabic data. All models, code, and pre-processed datasets will be publicly released to support reproducibility and further research in Arabic speech technologies.

Haroun Elleuch, Ryan Whetten, Salima Mdhaffar, Yannick Est\`eve, Fethi Bougares• 2026

Related benchmarks

TaskDatasetResultRank
Automatic Speech RecognitionTARIC-SLU (test)
WER21.14
6
Automatic Speech RecognitionCommon Voice Arabic 19.0 (test)
WER18.59
6
Automatic Speech RecognitionMGB-3 (test)
WER28.78
6
Automatic Speech RecognitionMGB-5 (test)
WER54.18
6
Dialect IdentificationADI-20 (val)
Accuracy97.21
4
Dialect IdentificationADI 20 (test)
Accuracy96.02
4
Showing 6 of 6 rows

Other info

Follow for update