Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Robust Safety and Utility Evaluation in Federated Learning on MaliciousGen & LMSYS-Chat

92.5Rule Compliance

Shadow-Level

49.49660.660571.82582.9895Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
92.579.04-1.493.34
2026.01
91.1578.08-1.643.14
2026.01
90.9674.42-1.544.01
2026.01
61.7315.38-3.563.22
2026.01
52.8812.31-3.793.14
2026.01
52.8810.77-3.843.06
2026.01
52.1210.58-3.812.98
2026.01
51.1512.31-3.643.03