Share your thoughts, 1 month free Claude Pro on usSee more

Robust Safety and Utility Evaluation in Federated Learning on MaliciousGen & WildChat

81.35Rule Score

Step-Level

Updated 4mo ago

Evaluation Results

Method	Links
Step-Level 2026.01		81.35	52.5	-1.71	3.98
Client-Level 2026.01		80	55.77	-1.87	4.13
Shadow-Level 2026.01		79.62	51.35	-1.78	4.12
TrimmedMean 2026.01		48.65	5.19	-3.48	3.81
FedAvg 2026.01		48.08	5.77	-3.59	3.82
Residual 2026.01		47.31	4.81	-3.49	4.12
FoolsGold 2026.01		46.73	5.96	-3.48	3.72
Krum 2026.01		46.15	5.01	-3.62	3.95