Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Editing on LLaMA2-13B Sequential batch-editing setup
Loading...
84.7
S Score
SUIT
72.74
75.845
78.95
82.055
Sep 29, 2025
S Score
Effectiveness (Eff.)
Generation Score (Gen.)
Specificity Score (Spe.)
Updated 1mo ago
Evaluation Results
Method
Method
Links
S Score
Effectiveness (Eff.)
Generation Score (Gen.)
Specificity Score (Spe.)
SUIT
editing_setup=5 batche...
2025.09
84.7
95
79.1
81.7
AlphaEdit
editing_setup=5 batche...
2025.09
79.3
98.4
76.5
68.4
MEMIT
editing_setup=5 batche...
2025.09
73.2
90
68.3
65.6
Feedback
Search any
task
Search any
task