Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Safety and Informativeness Evaluation on AdvBench

97.2Safety Rate

SafeMoE-XL

8.48831.51954.5577.581May 30, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
97.28.2
2026.05
80.56.9
2026.05
73.27.8
60.85.8
2026.05
55.26.9
2026.05
48.96.7
2026.05
44.57.2
2026.05
31.14.4
27.76.7
2026.05
25.15.9
2026.05
11.94.6