| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | MaliciousGen & WildChat (test) | Rule Adherence97.69 | 8 | |
| Robust Safety and Utility Evaluation in Federated Learning | MaliciousGen & WildChat | Rule Score81.35 | 8 | |
| Safety and Utility Evaluation | MaliciousGen & WildChat | Rule Adherence97.69 | 3 |