GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks

About

Structured deep model compression methods are hardware-friendly and substantially reduce memory and inference costs. However, under aggressive compression, the resulting accuracy degradation often necessitates post-compression finetuning, which can be impractical due to missing labeled data or high training cost. We propose post-hoc blockwise compensation, called GRAIL, a simple zero-finetuning step applied after model compression that restores each block's input-output behavior using a small calibration set. The method summarizes hidden activations via a Gram matrix and applies ridge regression to linearly reconstruct the original hidden representation from the reduced one. The resulting reconstruction map is absorbed into the downstream projection weights, while the upstream layer is compressed. The approach is selector-agnostic (Magnitude, Wanda, Gram-based selection, or folding), data-aware (requiring only a few forward passes without gradients or labels), and recovers classic pruning or folding when the Gram matrix is near identity, indicating weak inter-channel correlations. Across ResNets, ViTs, and decoder-only LLMs, GRAIL consistently improves accuracy or perplexity over data-free and data-aware pruning or folding baselines in practical compression regimes, with manageable overhead and no backpropagation. The code is available at https://github.com/TWWinde/GRAIL_Compensation.

Wenwu Tang, Dong Wang, Lothar Thiele, Olga Saukh• 2026

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText2	Perplexity5.75	2839
Commonsense Reasoning	HellaSwag	Accuracy53.73	1891
Language Modeling	C4	Perplexity8.05	1071
Language Modeling	PTB	Perplexity21.63	1034
Question Answering	ARC Challenge	--	906
Question Answering	ARC Easy	Accuracy73.82	597
Question Answering	PIQA	Accuracy76.55	374
Reading Comprehension	BoolQ	Accuracy74.46	279
Commonsense Reasoning	WinoGrande	Accuracy68.43	68

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord