HAAF: Hierarchical Adaptation and Alignment of Foundation Models for Few-Shot Pathology Anomaly Detection
About
Precision pathology relies on detecting fine-grained morphological abnormalities within specific Regions of Interest (ROIs), as these local, texture-rich cues - rather than global slide contexts - drive expert diagnostic reasoning. While Vision-Language (V-L) models promise data efficiency by leveraging semantic priors, adapting them faces a critical Granularity Mismatch, where generic representations fail to resolve such subtle defects. Current adaptation methods often treat modalities as independent streams, failing to ground semantic prompts in ROI-specific visual contexts. To bridge this gap, we propose the Hierarchical Adaptation and Alignment Framework (HAAF). At its core is a novel Cross-Level Scaled Alignment (CLSA) mechanism that enforces a sequential calibration order: visual features first inject context into text prompts to generate content-adaptive descriptors, which then spatially guide the visual encoder to spotlight anomalies. Additionally, a dual-branch inference strategy integrates semantic scores with geometric prototypes to ensure stability in few-shot settings. Experiments on four benchmarks show HAAF significantly outperforms state-of-the-art methods and effectively scales with domain-specific backbones (e.g., CONCH) in low-resource scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| WSI Classification | BRACS (test) | -- | 54 | |
| Image Classification | CRC (test) | Top-1 Acc89.55 | 34 | |
| Histopathology Anomaly Detection | Camelyon16 HIS | AUC91.97 | 17 | |
| Histopathology Anomaly Detection | SICAP v2 | AUC94.05 | 17 | |
| Histopathology Anomaly Detection | NCT-CRC | AUC0.9025 | 17 | |
| Histopathology Anomaly Detection | BRACS | AUC0.8353 | 17 | |
| Image Classification | HIS Camelyon16 (test) | AUC92.35 | 16 | |
| Image Classification | SICAP (test) | AUC97.7 | 16 |