Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia

About

Culturally aware safeguards are crucial for AI alignment in real-world settings, where safety extends beyond common sense and encompasses diverse local values, norms, and region-specific regulations. However, building large-scale, culturally grounded datasets is challenging due to limited resources and a scarcity of native annotators. Consequently, many safeguard models rely on machine translation of English datasets, often missing regional and cultural nuances. We present a novel agentic data-generation framework to scalably create authentic, region-specific safety datasets for Southeast Asia (SEA). On this foundation, we introduce the SEA-Guard family, the first multilingual safeguard models grounded in SEA cultural contexts. Evaluated across multiple benchmarks and cultural variants, SEA-Guard consistently outperforms existing safeguards at detecting regionally sensitive or harmful content while maintaining strong general safety performance.

Panuthep Tasawong, Jian Gang Ngui, Alham Fikri Aji, Trevor Cohn, Peerat Limkonchotiwat• 2026

Related benchmarks

TaskDatasetResultRank
Prompt ClassificationSEA-SafeguardBench
AUPRC (Average)93.6
29
Prompt ClassificationSEA-SafeguardBench English
AUPRC98.9
18
Response ClassificationSEA-SafeguardBench CG Cultural
AUPRC (English)75.4
16
Prompt ClassificationSEALS (SEA)
AUPRC96.9
9
Vision-text safety classificationVSCBench
AUPRC72.65
9
Vision-text safety classificationVLGuard
AUPRC (Prompt)0.8843
9
Vision-text safety classificationMSSBench Chat
AUPRC (Prompt)52.07
9
Vision-text safety classificationMSSBench Embodied
AUPRC (Prompt)61.97
9
Response ClassificationSafeQA English
AUPRC97.5
9
Response ClassificationSEA-SafeguardBench
AUPRC89.4
9
Showing 10 of 11 rows

Other info

Follow for update