Weakly Supervised Camouflaged Object Detection Based on the SAM Model and Mask Guidance
About
Camouflaged object detection (COD) from a single image is a challenging task due to the high similarity between objects and their surroundings. Existing fully supervised methods require labor-intensive pixel-level annotations, making weakly supervised methods a viable compromise that balances accuracy and annotation efficiency. However, weakly supervised methods often experience performance degradation due to the use of coarse annotations. In this paper, we introduce a new weakly supervised approach for camouflaged object detection to overcome these limitations. Specifically, we propose a novel network, MGNet, which tackles edge ambiguity and missed detections by utilizing initial masks generated by our custom-designed Cascaded Mask Decoder (CMD) to guide the segmentation process and enhance edge predictions. We introduce a Context Enhancement Module(CEM) to reduce the missing detection, and a Mask-guided Feature Aggregation Module (MFAM) for effective feature aggregation. For the weak supervision challenge, we propose BoxSAM, which leverages the Segment Anything Model (SAM) with bounding-box prompts to generate pseudo-labels. By employing a redundant processing strategy, high quality pixel-level pseudo-labels are provided for training MGNet. Extensive experiments demonstrate that our method delivers competitive performance against current state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Salient Object Detection | DUTS (test) | M (MAE)0.033 | 357 | |
| Camouflaged Object Detection | COD10K (test) | S-measure (S_alpha)0.883 | 306 | |
| Salient Object Detection | ECSSD | MAE0.029 | 226 | |
| Salient Object Detection | HKU-IS | MAE0.029 | 179 | |
| Camouflaged Object Detection | CAMO (test) | M0.102 | 154 | |
| Salient Object Detection | DUT-OMRON | MAE0.048 | 137 | |
| Polyp Segmentation | CVC-ColonDB (test) | Mean Dice0.807 | 68 | |
| Camouflaged Object Detection | NC4K | S_alpha0.896 | 33 | |
| Defect Detection | CDS2K (20% test) | S_alpha Score87.7 | 4 |