Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

No More Sliding Window: Efficient 3D Medical Image Segmentation with Differentiable Top-k Patch Sampling

About

3D models surpass 2D models in CT/MRI segmentation by effectively capturing inter-slice relationships. However, the added depth dimension substantially increases memory consumption. While patch-based training alleviates memory constraints, it significantly slows down the inference speed due to the sliding window (SW) approach. We propose No-More-Sliding-Window (NMSW), a novel end-to-end trainable framework that enhances the efficiency of generic 3D segmentation backbone during an inference step by eliminating the need for SW. NMSW employs a differentiable Top-k module to selectively sample only the most relevant patches, thereby minimizing redundant computations. When patch-level predictions are insufficient, the framework intelligently leverages coarse global predictions to refine results. Evaluated across 3 tasks using 3 segmentation backbones, NMSW achieves competitive accuracy compared to SW inference while significantly reducing computational complexity by 91% (88.0 to 8.00 TMACs). Moreover, it delivers a 9.1x faster inference on the H100 GPU (99.0 to 8.3 sec) and a 11.1x faster inference on the Xeon Gold CPU (2110 to 189 sec). NMSW is model-agnostic, further boosting efficiency when integrated with any existing efficient segmentation backbones. The code is avaialble: https://github.com/Youngseok0001/open_nmsw.

Young Seok Jeon, Hongfei Yang, Huazhu Fu, Mengling Feng• 2025

Related benchmarks

TaskDatasetResultRank
3D Medical Image SegmentationWORD 150 CT scans (test)
DSC86.3
11
3D Medical Image SegmentationTotalSegmentator Organ 1,204 CT scans (test)
DSC0.895
4
3D Medical Image SegmentationTotalSegmentator Vertebrae (test)
DSC90.9
4
Showing 3 of 3 rows

Other info

Code

Follow for update