AtlasPatch: Efficient Tissue Detection and High-throughput Patch Extraction for Computational Pathology at Scale
About
Whole-slide image (WSI) preprocessing, comprising tissue detection followed by patch extraction, is foundational to AI-driven computational pathology but remains a major bottleneck for scaling to large and heterogeneous cohorts. We present AtlasPatch, a scalable framework that couples foundation-model tissue detection with high-throughput patch extraction at minimal computational overhead. Our tissue detector achieves high precision (0.986) and remains robust across varying tissue conditions (e.g., brightness, fragmentation, boundary definition, tissue heterogeneity) and common artifacts (e.g., pen/ink markings, scanner streaks). This robustness is enabled by our annotated, heterogeneous multi-cohort training set of ~30,000 WSI thumbnails combined with efficient adaptation of the Segment-Anything (SAM) model. AtlasPatch also reduces end-to-end WSI preprocessing time by up to 16$\times$ versus widely used deep-learning pipelines, without degrading downstream task performance. The AtlasPatch tool is open-source, efficiently parallelized for practical deployment, and supports options to save extracted patches or stream them into common feature-extraction models for on-the-fly embedding, making it adaptable to both pathology departments (tissue detection and quality control) and AI researchers (dataset creation and model training). AtlasPatch software package is available at https://github.com/AtlasAnalyticsLab/AtlasPatch.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cancer Subtyping | TCGA-BRCA (test) | Accuracy93.7 | 12 | |
| Patch-Level Classification | Colorectal cancer (test) | Bal.ACC98 | 11 | |
| Lung cancer subtyping | TCGA LUAD vs LUSC (test) | Accuracy95.8 | 4 | |
| Renal Cell Carcinoma Subtyping | TCGA-KIRC vs KIRP (test) | Accuracy97.7 | 4 | |
| Dysplasia level classification | in-house dysplasia (test) | Accuracy96.6 | 4 | |
| Prostate cancer grading | PANDA (test) | Accuracy73.5 | 3 |