MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization
About
We present a systematic study of multilingual polarization detection across 22 languages for SemEval-2026 Task 9 (Subtask 1), contrasting multilingual generalists with language-specific specialists and hybrid ensembles. While a standard generalist like XLM-RoBERTa suffices when its tokenizer aligns with the target text, it may struggle with distinct scripts (e.g., Khmer, Odia) where monolingual specialists yield significant gains. Rather than enforcing a single universal architecture, we adopt a language-adaptive framework that switches between multilingual generalists, language-specific specialists, and hybrid ensembles based on development performance. Additionally, cross-lingual augmentation via NLLB-200 yielded mixed results, often underperforming native architecture selection and degrading morphologically rich tracks. Our final system achieves an overall macro-averaged F1 score of 0.796 and an average accuracy of 0.826 across all 22 tracks. Code and final test predictions are publicly available at: https://github.com/Maziarkiani/SemEval2026-Task9-Subtask1-Polarization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Polarization detection | SemEval Task 9 2026 (test) | Macro-F188.92 | 17 | |
| Polarization Analysis | SemEval Task 9 (amh) 2026 (test) | Accuracy82.4 | 1 | |
| Polarization Analysis | SemEval Task 9 arb 2026 (test) | Accuracy83.2 | 1 | |
| Polarization Analysis | SemEval Task 9 ben 2026 (test) | Accuracy84.7 | 1 | |
| Polarization Analysis | SemEval Task 9 mya 2026 (test) | Accuracy88.9 | 1 | |
| Polarization Analysis | SemEval-2026 Task 9 zho (test) | Accuracy89.1 | 1 | |
| Polarization Analysis | SemEval Task 9 eng 2026 (test) | Accuracy81 | 1 | |
| Polarization Analysis | SemEval Task 9 deu 2026 (test) | Accuracy71.1 | 1 | |
| Polarization Analysis | SemEval Task 9 hau 2026 (test) | Accuracy93.1 | 1 | |
| Polarization Analysis | SemEval-2026 Task 9 hin (test) | Accuracy89.6 | 1 |