Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Editing on ZsRE N = 5000
Loading...
100
Reliability
GRACE
8.48
32.24
56
79.76
May 2, 2026
Reliability
Generalization
Locality
OP
Updated 22d ago
Evaluation Results
Method
Method
Links
Reliability
Generalization
Locality
OP
GRACE
Model=Qwen2.5 7B
2026.05
100
3
100
31
GRACE
Model=DeepSeek 8B
2026.05
100
2
100
27
HoReN
Model=DeepSeek 8B
2026.05
100
94
97
97
HoReN
Model=SEA-LION 32B
2026.05
100
90
97
96
HoReN
Model=Qwen2.5 7B
2026.05
99
89
100
96
HoReN
Model=GPT-OSS 20B
2026.05
99
67
99
87
GRACE
Model=GPT-OSS 20B
2026.05
60
2
100
23
WISE
Model=SEA-LION 32B
2026.05
58
55
100
68
GRACE
Model=SEA-LION 32B
2026.05
43
2
100
20
WISE
Model=DeepSeek 8B
2026.05
36
33
100
49
WISE
Model=Qwen2.5 7B
2026.05
24
22
79
35
WISE
Model=GPT-OSS 20B
2026.05
12
11
12
12
Feedback
Search any
task
Search any
task