Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization

About

Foundation models, such as large language models (LLMs), are powerful but often require customization before deployment to satisfy practical constraints such as safety, privacy, and task-specific requirements, leading to "constrained" optimization problems for model steering and adaptation. However, solving such problems remains largely underexplored and is particularly challenging due to interference between the primary objective and constraint objectives during optimization. In this paper, we propose a subspace control framework for constrained model training. Specifically, (i) we first analyze, from a model merging perspective, how spectral cross-task interference arises and show that it can be resolved via a one-shot solution that orthogonalizes the merged subspace; (ii) we establish a connection between this solution and gradient orthogonalization in the spectral optimizer Muon; and (iii) building on these insights, we introduce SIFT (spectral interference-free training), which leverages a localization scheme to selectively intervene during optimization, enabling controllable updates that mitigate objective-constraint conflicts. We evaluate SIFT across four representative applications: (a) machine unlearning, (b) safety alignment, (c) text-to-speech adaptation, and (d) hallucination mitigation. Compared to both control-based and control-free baselines, SIFT consistently achieves substantial and robust performance improvements across all tasks. Code is available at https://github.com/OPTML-Group/SIFT.

Yancheng Huang, Changsheng Wang, Chongyu Fan, Yicheng Lang, Bingqi Shang, Yang Zhang, Mingyi Hong, Qing Qu, Alvaro Velasquez, Sijia Liu• 2026

Related benchmarks

Task	Dataset	Result
Instruction Following	IFEval	IFEval Accuracy32.7	836
Multi-task Language Understanding	MMLU	Accuracy56.8	353
Question Answering	TruthfulQA	Accuracy38.6	164
Natural Language Inference	MNLI	--	80
Natural Language Inference	QNLI	Accuracy68.2	78
Language Modeling	MMLU	MMLU Final Performance46.2	42
Language Understanding	MMLU	MMLU Score39.2	40
Question Answering	TruthfulQA	TruthfulQA29	37
Safety Alignment	WildJailbreak	Safe@156	24
Natural Language Inference	MNLI	MNLI Accuracy40.5	23

Showing 10 of 28 rows

Other info

Follow for update

@wizwand_team Discord