| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | MaliciousGen & LMSYS-Chat (test) | Rule Score97.31 | 8 | |
| Robust Safety and Utility Evaluation in Federated Learning | MaliciousGen & LMSYS-Chat | Rule Compliance92.5 | 8 | |
| Safety and Utility Evaluation | MaliciousGen & LMSYS-Chat | Rule Score97.31 | 3 |