Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks

About

Structured deep model compression methods are hardware-friendly and substantially reduce memory and inference costs. However, under aggressive compression, the resulting accuracy degradation often necessitates post-compression finetuning, which can be impractical due to missing labeled data or high training cost. We propose post-hoc blockwise compensation, called GRAIL, a simple zero-finetuning step applied after model compression that restores each block's input-output behavior using a small calibration set. The method summarizes hidden activations via a Gram matrix and applies ridge regression to linearly reconstruct the original hidden representation from the reduced one. The resulting reconstruction map is absorbed into the downstream projection weights, while the upstream layer is compressed. The approach is selector-agnostic (Magnitude, Wanda, Gram-based selection, or folding), data-aware (requiring only a few forward passes without gradients or labels), and recovers classic pruning or folding when the Gram matrix is near identity, indicating weak inter-channel correlations. Across ResNets, ViTs, and decoder-only LLMs, GRAIL consistently improves accuracy or perplexity over data-free and data-aware pruning or folding baselines in practical compression regimes, with manageable overhead and no backpropagation. The code is available at https://github.com/TWWinde/GRAIL_Compensation.

Wenwu Tang, Dong Wang, Lothar Thiele, Olga Saukh• 2026

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText2
Perplexity5.75
2839
Commonsense ReasoningHellaSwag
Accuracy53.73
1891
Language ModelingC4
Perplexity8.05
1071
Language ModelingPTB
Perplexity21.63
1034
Question AnsweringARC Challenge--
906
Question AnsweringARC Easy
Accuracy73.82
597
Question AnsweringPIQA
Accuracy76.55
374
Reading ComprehensionBoolQ
Accuracy74.46
279
Commonsense ReasoningWinoGrande
Accuracy68.43
68
Showing 9 of 9 rows

Other info

Follow for update