| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | BeaverTails & LMSYS-Chat (test) | Rule Score97.88 | 8 | |
| Robust Safety and Utility Evaluation in Federated Learning | BeaverTails & LMSYS-Chat | Rule Score91.92 | 8 | |
| Safety and Utility Evaluation | BeaverTails & LMSYS-Chat | Rule Score97.88 | 3 |