| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WMDP bio | Accuracy71.2 | 42 | 1mo ago | ||
| WMDP cyber | Accuracy47.21 | 38 | 1mo ago | ||
| RWKU (Forget Set) | FB85.9 | 23 | 2d ago | ||
| 16-task Sequential Unlearning Forgotten Data Avg | CORE | CRR90.9 | 18 | 25d ago | |
| GONE FB Templates Wikidata | Direct Success Rate99.6 | 18 | 1mo ago | ||
| GONE Wikidata (QA Templates) | Direct Success Rate100 | 18 | 1mo ago | ||
| 16-task Sequential Unlearning Forgotten Data Last | LwF | Context-aware Refusal Rate (CRR)41.01 | 16 | 25d ago | |
| FaithUn | FS91.92 | 16 | 1mo ago | ||
| The Pile 32 sample (val) | NEO + DPD+ | EL10 (%)0 | 15 | 1mo ago | |
| Internal e-commerce benchmark medium-scale seller 387 items (Forget Set) | ME+GD | ROUGE89.4 | 14 | 1mo ago | |
| WMDP Bio (test) | Refusal Training | Accuracy Forget64.81 | 11 | 1mo ago | |
| MUSE (forget set Df) | VerbMem Df Pre57.9 | 8 | 2d ago | ||
| RWKU Utility Set | Fac Score58.2 | 6 | 1mo ago | ||
| RWKU MIA Set | DPO | FM228 | 6 | 1mo ago | |
| RWKU (Neighbor Set) | FB Score95.6 | 6 | 1mo ago | ||
| MMLU Target Subjects | ROKA | mEM52.19 | 4 | 1mo ago | |
| RWKU | ICU | FB47.5 | 3 | 1mo ago | |
| MMLU Non-Target Subjects | ROKA | mEM26.82 | 2 | 1mo ago | |
| MMLU All Subjects | ROKA | Exact Match (EM)49.57 | 2 | 1mo ago |