Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sequential Knowledge Editing on MQuAKE
Loading...
97.4
Efficacy
D4S
-3.896
22.402
48.7
74.998
Oct 31, 2024
Efficacy
Paraphrase Score
Specificity
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Efficacy
Paraphrase Score
Specificity
Average Score
D4S
Model=GPT
2024.10
97.4
75.54
21.84
64.93
D4S
Model=Llama
2024.10
85.3
72.68
28.16
62.05
ROME
Model=Llama
2024.10
76.85
78.41
3.67
52.97
FT
Model=Llama
2024.10
41.43
44.93
28.24
38.2
FT
Model=GPT
2024.10
17
6.22
0
7.74
ROME
Model=GPT
2024.10
0
0
0
0
MEMIT
Model=GPT
2024.10
0
0
0
0
MEMIT
Model=Llama
2024.10
0
0
0
0
Feedback
Search any
task
Search any
task