| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Knowledge Editing | Counterfact | Efficacy9,387 | 362 | |
| Sequential model editing | Counterfact | Efficacy99.88 | 81 | |
| Sequential Model Editing | CounterFact T = 300 | Efficacy99.3 | 36 | |
| Subject inference attack | CounterFact batch-edit tasks | Recall100 | 36 | |
| Lifelong Model Editing | CounterFact | Efficacy73.06 | 33 | |
| Sequential Knowledge Editing | CounterFact sequential editing 10,000 Samples | Efficacy Success99.5 | 33 | |
| Knowledge Editing | Counterfact uns | Edit Success Rate94.56 | 30 | |
| Model Editing | CounterFact | Reliability92 | 30 | |
| Knowledge Editing | Counterfact 10,000 facts | Relational Score10,000 | 27 | |
| Knowledge Model Editing | CounterFact | Efficacy64.85 | 26 | |
| Model Editing | CounterFact | Reliability79.4 | 26 | |
| Model Editing | CounterFact | Efficacy96.12 | 24 | |
| Knowledge Editing | COUNTERFACT RS | Efficacy100 | 23 | |
| Classification Probing | Counterfact (test) | Probe Acc (Best Layer)89.6 | 21 | |
| Knowledge Editing | Counterfact Full (test) | Rel. Accuracy99 | 21 | |
| One-Time Edit | CounterFact | Efficacy99.88 | 20 | |
| Knowledge Editing | Counterfact | AVG Score92.96 | 20 | |
| Knowledge Editing | Counterfact (first 2000 edits) | Accuracy99.95 | 17 | |
| Sequential Knowledge Editing | CounterFact larger | Efficacy98.97 | 14 | |
| Sequential Knowledge Editing | CounterFact top | Efficacy93.87 | 14 | |
| Lifelong Knowledge Editing | COUNTERFACT | Reliability67.1 | 14 | |
| Sequential Model Editing | CounterFact T = 5000 | Efficacy96.6 | 13 | |
| Model Editing | CounterFact 3,000 samples (test) | Reliability9,980 | 13 | |
| Fact | CounterFact | Efficacy Score (%)76.54 | 12 | |
| Knowledge Editing | Counterfact (test) | RwA99.86 | 12 |