| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Instruction Following | MQUAKE | Accuracy82.5 | 24 | |
| Knowledge Editing | MQuAKE-Story 1.0 (test) | Fact Accuracy (Easy)100 | 14 | |
| Knowledge Editing | MQuAKE Story | Fact Accuracy (Easy)100 | 14 | |
| Knowledge Editing | MQuAKE-CF 1.0 (test) | Fact Accuracy (Easy)99.9 | 14 | |
| Multi-hop Knowledge Editing | MQUAKE-T (All edited) | Accuracy78.16 | 12 | |
| Multi-hop Knowledge Editing | MQUAKE-T (1 edited) | Accuracy97.7 | 12 | |
| Multi-hop Knowledge Editing | MQUAKE-CF-3K (100 edited) | Accuracy56 | 12 | |
| Multi-hop Knowledge Editing | MQUAKE-CF-3K (1 edited) | Accuracy67.27 | 12 | |
| Multi-hop Question Answering | MQuAKE | MHQ Accuracy31.6 | 10 | |
| Multi-hop Knowledge Editing | MQUAKE-CF-3K All edited | Accuracy45.87 | 10 | |
| Knowledge Editing | MQUAKE | Average Accuracy0.7589 | 8 | |
| Sequential Knowledge Editing | MQuAKE | Efficacy97.4 | 8 | |
| Multi-hop Knowledge Editing | MQuAKE CF v2 | 2-hop Score69.1 | 6 |