POME: Post Optimization Model Edit via Muon-style Projection

About

We introduce Post-Optimization Model Edit (POME), a new algorithm that enhances the performance of fine-tuned large language models using only their pretrained and fine-tuned checkpoints, without requiring extra data or further optimization. The core idea is to apply a muon-style projection to $\Delta W$, the difference between the fine-tuned and pretrained weights. This projection uses truncated singular value decomposition (SVD) to equalize the influence of dominant update directions and prune small singular values, which often represent noise. As a simple post-processing step, POME is completely decoupled from the training pipeline. It requires zero modifications and imposes no overhead, making it universally compatible with any optimizer or distributed framework. POME delivers consistent gains, boosting average performance by +2.5\% on GSM8K and +1.0\% on code generation. Its broad applicability -- from 7B foundation models to 72B RLHF-instructed models -- establishes it as a practical, zero-cost enhancement for any fine-tuning pipeline. Code is available at https://github.com/NUS-HPC-AI-Lab/POME.

Yong Liu, Di Fu, Yang Luo, Zirui Zhu, Minhao Cheng, Cho-Jui Hsieh, Yang You• 2025

Related benchmarks

Task	Dataset	Result
Instruction Following	IFEval	IFEval Accuracy31.6	836
Multi-task Language Understanding	MMLU	Accuracy57.3	353
Question Answering	TruthfulQA	Accuracy38.4	164
Natural Language Inference	MNLI	--	80
Natural Language Inference	QNLI	Accuracy68.2	78
Language Modeling	MMLU	MMLU Final Performance46	42
Language Understanding	MMLU	MMLU Score38.7	40
Question Answering	TruthfulQA	TruthfulQA29.2	37
Safety Alignment	WildJailbreak	Safe@151.6	24
Natural Language Inference	MNLI	MNLI Accuracy32.5	23

Showing 10 of 28 rows

Other info

Follow for update

@wizwand_team Discord