HAAF: Hierarchical Adaptation and Alignment of Foundation Models for Few-Shot Pathology Anomaly Detection

About

Precision pathology relies on detecting fine-grained morphological abnormalities within specific Regions of Interest (ROIs), as these local, texture-rich cues - rather than global slide contexts - drive expert diagnostic reasoning. While Vision-Language (V-L) models promise data efficiency by leveraging semantic priors, adapting them faces a critical Granularity Mismatch, where generic representations fail to resolve such subtle defects. Current adaptation methods often treat modalities as independent streams, failing to ground semantic prompts in ROI-specific visual contexts. To bridge this gap, we propose the Hierarchical Adaptation and Alignment Framework (HAAF). At its core is a novel Cross-Level Scaled Alignment (CLSA) mechanism that enforces a sequential calibration order: visual features first inject context into text prompts to generate content-adaptive descriptors, which then spatially guide the visual encoder to spotlight anomalies. Additionally, a dual-branch inference strategy integrates semantic scores with geometric prototypes to ensure stability in few-shot settings. Experiments on four benchmarks show HAAF significantly outperforms state-of-the-art methods and effectively scales with domain-specific backbones (e.g., CONCH) in low-resource scenarios.

Chunze Yang, Wenjie Zhao, Yue Tang, Junbo Lu, Jiusong Ge, Qidong Liu, Zeyu Gao, Chen Li• 2026

Related benchmarks

Task	Dataset	Result
WSI Classification	BRACS (test)	--	54
Image Classification	CRC (test)	Top-1 Acc89.55	34
Histopathology Anomaly Detection	Camelyon16 HIS	AUC91.97	17
Histopathology Anomaly Detection	SICAP v2	AUC94.05	17
Histopathology Anomaly Detection	NCT-CRC	AUC0.9025	17
Histopathology Anomaly Detection	BRACS	AUC0.8353	17
Image Classification	HIS Camelyon16 (test)	AUC92.35	16
Image Classification	SICAP (test)	AUC97.7	16

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord