Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
About
Adapting large pretrained models to diverse tasks is now routine, yet the two dominant strategies of parameter-efficient fine-tuning (PEFT) and low-rank compression are typically composed in sequence. This decoupled practice first compresses and then fine-tunes adapters, potentially misaligning the compressed subspace with downstream objectives and squandering a global parameter budget. To overcome this limitation, we introduce JACTUS (Joint Adaptation and Compression with a Task-aware Union of Subspaces), a single framework that unifies compression and adaptation. From a small calibration set, JACTUS estimates input and pre-activation gradient covariances, forms their orthogonal union with the pretrained weight subspace, performs a projected low-rank approximation inside this union, allocates rank globally by marginal gain per parameter, and trains only a compact core matrix. This explicitly mitigates the potential misalignment between the compressed subspace and downstream objectives by coupling the directions preserved for compression with those required for adaptation, yielding a deployable low-rank model that avoids retaining full frozen weights while enabling fast and robust tuning. On vision, JACTUS attains an average 89.2% accuracy on ViT-Base across eight datasets at 80% retained parameters, surpassing strong 100% PEFT baselines (e.g., DoRA 87.9%). On language, JACTUS achieves an 80.9% average on Llama2-7B commonsense QA at the same 80% retained-parameter budget, outperforming 100% PEFT (e.g., DoRA 79.7%) and exceeding prior compress-then-finetune pipelines under the same ratained-parameter budget. We will release code.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mathematical Reasoning | GSM8K | Accuracy62.5 | 1398 | |
| Image Classification | DTD | -- | 599 | |
| Image Classification | RESISC45 | -- | 472 | |
| Image Classification | CIFAR-100 | -- | 204 | |
| Image Classification | FGVC Aircraft | Top-1 Acc71.5 | 112 | |
| Image Classification | Stanford Cars | Top-1 Accuracy87.8 | 98 | |
| Image Classification | EuroSAT | Top-1 Accuracy99.3 | 90 | |
| Image Classification | Oxford-IIIT Pet | Top-1 Accuracy95.2 | 75 | |
| Commonsense Question Answering | Commonsense QA | BoolQ Accuracy70.7 | 29 | |
| Image Classification | Vision Classification Suite (CIFAR-100, EuroSAT, RESISC45, Stanford Cars, FGVC Aircraft, DTD, CIFAR-10, Oxford-IIIT Pets) (test) | CIFAR-100 Accuracy95.2 | 20 |