CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection
About
Adapting CLIP to vertical domains is typically approached by novel fine-tuning strategies or by continual pre-training (CPT) on large domain-specific datasets. Yet, data itself remains an underexplored factor in this process. We revisit this task from a data-centric perspective: Can effective data selection substitute for large-scale datasets in CPT? We introduce CHIPS (Curvature-aware Hybrid Influence in Projection Subspace), which assigns each image-text pair a utility score that integrates three complementary factors aligned with three goals: faithfulness via a curvature-aware and Newton-style alignment computed in CLIP's end-point subspace; scalability via an InfoNCE-aware curvature estimator with Johnson-Lindenstrauss (JL) sketching; and retention via a selection-aware relevance weight combined with learnability to balance target adaptation against general-domain preservation. We justify this design theoretically by proving a lower-bound guarantee on the proxy's correlation with full-parameter alignment and by characterizing the bias-variance trade-offs introduced by curvature mixing and JL sketching. We evaluate CHIPS empirically across various settings: 1) CHIPS attains state-of-the-art performance among selection baselines on 17 medical benchmarks, matches full-dataset CPT with 30% of the data, and outperforms half-dataset CPT using only 10%; 2) on 31 general-domain benchmarks, CHIPS yields the least performance drop under all retention ratios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Image-Text Classification | Medical Specialties | Ophthalmology Performance44.13 | 30 | |
| Image Classification | General Domain 31 tasks | CLS Score47.88 | 30 | |
| Image-Text Retrieval | General Domain | Retrieval Score26.11 | 30 | |
| Medical Image Classification | Medical CLS | Accuracy35.48 | 21 | |
| Image Classification | General CLS | Accuracy58.24 | 21 | |
| Image-Text Retrieval | General RET | Recall32.09 | 21 | |
| Medical Image Classification | MedTrinity (test) | -- | 8 | |
| Medical Image Classification | MedTrinity v1.0 (test) | -- | 8 |