Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cross-Layer Subspace Coupling for LLM Compression: A Unifying Framework and Its Empirical Limits

About

Recent SVD based compression methods for large language models like SVD LLM and Basis Sharing can be unified under one optimization problem. While mathematical proofs and tests on Pythia models show this unified approach improves weight reconstruction error by up to 46% percent it fails in practical tasks. Downstream metrics like perplexity and accuracy severely degrade compared to standard per layer SVD LLM. The authors explain this failure mechanistically. Although the bundle method mathematically couples adjacent layers the transformer residual stream actually decouples them during forward passes. Thus per layer optimality matters more than joint cross layer optimization. The paper concludes that weight space reconstruction is a flawed objective for cross layer compression and future methods must focus on per layer activation reconstruction instead.

Snigdha Chandan Khilar• 2026

Related benchmarks

TaskDatasetResultRank
Common Sense ReasoningHellaSwag
Accuracy (acc_n)27.85
47
Frobenius reconstructionPythia attention output projections 70M
Reconstruction Error Fraction16.8
4
Frobenius reconstructionPythia attention output projections 1.4B
Dimensionless Reconstruction Error Fraction0.269
4
Activation ReconstructionPythia 1.4b--
4
Showing 4 of 4 rows

Other info

Follow for update