Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

No time to train! Training-Free Reference-Based Instance Segmentation

About

The performance of image segmentation models has historically been constrained by the high cost of collecting large-scale annotated data. The Segment Anything Model (SAM) alleviates this original problem through a promptable, semantics-agnostic, segmentation paradigm and yet still requires manual visual-prompts or complex domain-dependent prompt-generation rules to process a new image. Towards reducing this new burden, our work investigates the task of object segmentation when provided with, alternatively, only a small set of reference images. Our key insight is to leverage strong semantic priors, as learned by foundation models, to identify corresponding regions between a reference and a target image. We find that correspondences enable automatic generation of instance-level segmentation masks for downstream tasks and instantiate our ideas via a multi-stage, training-free method incorporating (1) memory bank construction; (2) representation aggregation and (3) semantic-aware feature matching. Our experiments show significant improvements on segmentation metrics, leading to state-of-the-art performance on COCO FSOD (36.8% nAP), PASCAL VOC Few-Shot (71.2% nAP50) and outperforming existing training-free approaches on the Cross-Domain FSOD benchmark (22.4% nAP).

Miguel Espinosa, Chenhongyi Yang, Linus Ericsson, Steven McDonagh, Elliot J. Crowley• 2025

Related benchmarks

TaskDatasetResultRank
Few-shot Object DetectionCD-FSOD
ArTaxOr Score35.7
152
Few-shot Semantic SegmentationCOCO-20i (val)
mIoU Mean0.527
78
Object DetectionPascal-5i 2010 (Novel Split 1)
nAP5070.8
54
Object DetectionCOCO-FSOD 30-shot COCO-20
nAP36.8
47
Few-shot Object DetectionMS-COCO 10-shot (novel classes)
nAP36.6
34
Few-shot Object DetectionMS-COCO 30-shot (novel classes)
nAP (Novel)36.8
34
Object DetectionPascal-5i 2010 (Novel Split 3)
nAP5072.6
19
Few-shot Object DetectionPascal VOC (Novel Split 1)
AP5079.1
16
Object DetectionCOCO-20i 10-shot
nAP36.6
16
Object DetectionCOCO-FSOD 10-shot 20
nAP36.6
14
Showing 10 of 20 rows

Other info

Code

Follow for update