Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning
About
Traditional whole slide image (WSI) analysis methods typically rely on the multiple instance learning (MIL) paradigm, which extracts patch-level features at high magnification and aggregates them for slide-level prediction. However, such exhaustive patch-level processing is computationally expensive, severely limiting the efficiency and scalability of WSI analysis. To address this challenge, we propose PathCTM (a Pathology-oriented Continuous Thought Model) that enables token-efficient scale-space continuous reasoning for gigapixel WSIs. PathCTM formulates diagnostic inference as a dynamic sequential information pursuit. It progressively transitions from low-magnification global to high-magnification local inspection, and adaptively terminates inference when sufficient evidence is gathered to effectively bound decision uncertainty. Specifically, it uses conditional computation for dynamic scale switching with attention-guided region pruning, coupled with confidence-aware early stopping. Extensive experiments demonstrate that, compared with standard MIL-based methods, PathCTM reduces the number of required image patches by 95.95% and shortens inference time by approximately 95.62%, while maintaining AUC without degradation. Code is available at https://github.com/JSGe-AI/PathCTM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Diagnostic Classification | BRACS-7 | AUC0.889 | 86 | |
| Cancer Subtyping | BRACS-7 | AUC0.893 | 40 | |
| Breast cancer subtype classification | BRACS-3 | AUC93.6 | 22 | |
| Gene Mutation Prediction | MUT-SETD2 MUT-HET-RCC | AUC75.4 | 22 | |
| Renal cell carcinoma staging | RCC-STAGING TCGA | AUC83 | 22 | |
| Lesion-type classification | BRACS-3 | AUC93.1 | 5 | |
| Renal cell carcinoma staging | RCC-STAGING | AUC82.84 | 5 | |
| SETD2 mutation prediction | MUT-SETD2 | AUC73.94 | 5 |