Exploiting DINOv3-Based Self-Supervised Features for Robust Few-Shot Medical Image Segmentation
About
Deep learning-based automatic medical image segmentation plays a critical role in clinical diagnosis and treatment planning but remains challenging in few-shot scenarios due to the scarcity of annotated training data. Recently, self-supervised foundation models such as DINOv3, which were trained on large natural image datasets, have shown strong potential for dense feature extraction that can help with the few-shot learning challenge. Yet, their direct application to medical images is hindered by domain differences. In this work, we propose DINO-AugSeg, a novel framework that leverages DINOv3 features to address the few-shot medical image segmentation challenge. Specifically, we introduce WT-Aug, a wavelet-based feature-level augmentation module that enriches the diversity of DINOv3-extracted features by perturbing frequency components, and CG-Fuse, a contextual information-guided fusion module that exploits cross-attention to integrate semantic-rich low-resolution features with spatially detailed high-resolution features. Extensive experiments on six public benchmarks spanning five imaging modalities, including MRI, CT, ultrasound, endoscopy, and dermoscopy, demonstrate that DINO-AugSeg consistently outperforms existing methods under limited-sample conditions. The results highlight the effectiveness of incorporating wavelet-domain augmentation and contextual fusion for robust feature representation, suggesting DINO-AugSeg as a promising direction for advancing few-shot medical image segmentation. Code and data will be made available on https://github.com/apple1986/DINO-AugSeg.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Image Segmentation | Synapse | -- | 52 | |
| Medical Image Segmentation | TN3K 25 samples (train) | DICE Score60.13 | 16 | |
| Medical Image Segmentation | Kvasir-SEG 10 samples (train) | Dice Score (%)73.59 | 16 | |
| Medical Image Segmentation | Kvasir-SEG 40 samples (train) | DICE78.61 | 16 | |
| Medical Image Segmentation | ACDC seven-shot | DICE81.85 | 16 | |
| Medical Image Segmentation | LA seven-shot 2018 | DICE86.22 | 16 | |
| Medical Image Segmentation | Synapse seven-shot | DICE71.19 | 16 | |
| Medical Image Segmentation | TN3K 100 samples (train) | DICE65.4 | 16 | |
| Medical Image Segmentation | ISIC 5 samples 2018 (train) | DICE (%)67.64 | 16 | |
| Medical Image Segmentation | ISIC 25 samples 2018 (train) | Dice Score78.53 | 16 |