Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Model Editing on COUNTERFACT
Loading...
88.2
S
ROME
30.584
45.542
60.5
75.458
Feb 10, 2022
S
ES
EM
PS
PM
NS
NM
GE
RS
Updated 4d ago
Evaluation Results
Method
Method
Links
S
ES
EM
PS
PM
NS
NM
GE
RS
ROME
Base Model=GPT-2 Large...
2022.02
88.2
99.9
98.2
96.3
60.4
73.4
3.5
622.5
41.9
ROME
Base Model=GPT-2 Mediu...
2022.02
87.4
100
94.9
96.4
56.9
71.8
2.8
625
41.7
FT+L
Base Model=GPT-2 Large...
2022.02
71.2
100
96.3
63
5.1
61.5
1.1
625.2
39.3
FT+L
Base Model=GPT-2 Mediu...
2022.02
68
100
94.9
68.5
6.1
51.3
-1.7
626.1
39.3
GPT-2 M
Base Model=GPT-2 Mediu...
2022.02
33.4
25
-3.3
27.4
-3
74.9
3.6
625.8
31.4
GPT-2 L
Base Model=GPT-2 Large...
2022.02
32.8
23.9
-4
27.4
-3.5
75.7
4.3
625.4
31.8
Feedback
Search any
task
Search any
task