| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| StrongReject | Identity-Robust Generation | Personalization Bias (PB)0.179 | 9 | 1mo ago | |
| Safety Prompts (randomly selected 200 samples per field) | llama2 -> CP -> FT + 0.5 chat vector | Insensitivity Score1.5 | 9 | 1mo ago | |
| Qualitative Assessment Dataset | Mi:dm 2.0-Base | Not Unsafe Rate (Content Safety)97.7 | 4 | 27d ago |