Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

About

A central challenge in large language model (LLM) editing is capability preservation: methods that successfully change targeted behavior can quietly game the editing proxy and corrupt general capabilities, producing degenerate behaviors reminiscent of proxy/reward hacking. We present CrispEdit, a scalable and principled second-order editing algorithm that treats capability preservation as an explicit constraint, unifying and generalizing several existing editing approaches. CrispEdit formulates editing as constrained optimization and enforces the constraint by projecting edit updates onto the low-curvature subspace of the capability-loss landscape. At the crux of CrispEdit is expressing capability constraint via Bregman divergence, whose quadratic form yields the Gauss-Newton Hessian exactly and even when the base model is not trained to convergence. We make this second-order procedure efficient at the LLM scale using Kronecker-factored approximate curvature (K-FAC) and a novel matrix-free projector that exploits Kronecker structure to avoid constructing massive projection matrices. Across standard model-editing benchmarks, CrispEdit achieves high edit success while keeping capability degradation below 1% on average across datasets, significantly improving over prior editors.

Zarif Ikram, Arad Firouzkouhi, Stephen Tu, Mahdi Soltanolkotabi, Paria Rashidinejad• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Math Score71
171
Commonsense ReasoningARC-C
Accuracy52
51
Model EditingWikiBigEdit
MMLU69.3
34
Model EditingzsRE
Reliability80.5
26
Model EditingCounterFact
Reliability79.4
26
Model EditingzsRE
Reliability0.774
16
Multi-task Language UnderstandingMMLU
MMLU Score68.5
14
Model EditingCounterFact 3,000 samples (test)
Reliability9.98e+3
13
Model EditingWikiBigEdit 3,000 samples (test)
Reliability99.9
13
Model EditingZsRE 3,000 samples (test)
Relational Score99.1
13
Showing 10 of 15 rows

Other info

GitHub

Follow for update