Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Robust Safety and Utility Evaluation in Federated Learning on BeaverTails & LMSYS-Chat

91.92Rule Score

Shadow-Level

49.924860.827471.7382.6326Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
91.9277.12-1.583.2
2026.01
88.8576.35-1.733.07
2026.01
84.4265.96-2.013.05
2026.01
60.1920.58-3.522.96
2026.01
53.2718.08-3.843.15
2026.01
53.0812.12-4.013.14
2026.01
51.7314.81-3.973.18
2026.01
51.5411.35-3.973.16