Segment Anything with Robust Uncertainty-Accuracy Correlation
About
Despite strong zero-shot performance, SAM is unreliable under domain shift due to Mask-level Confidence Confusion (MCC), where a single IoU-based mask score fails to reflect pixel-wise reliability near boundaries. Motivated by the contrast between texture-biased shortcuts in neural networks and shape-centric processing in human vision, we model out-of-domain variation as appearance shifts and non-rigid deformations that jointly stress calibration. We propose Segment Anything with Robust Uncertainty-Accuracy Correlation (RUAC) for robust pixel-wise uncertainty estimation under appearance and deformation shifts. RUAC adds a lightweight uncertainty head, trains it with a collaborative style-deformation attack that jointly perturbs texture and geometry, and applies Uncertainty-Accuracy Alignment to ensure uncertainty consistently highlights erroneous pixels even under adversarial perturbations. Across 23 zero-shot domains, RUAC improves segmentation quality and yields more faithful uncertainty with stronger uncertainty-accuracy correlation. Project page: https://hongyouzhou.github.io/ruac/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Segmentation | TrashCan | J&F Score75 | 20 | |
| Object Segmentation | iShape | J&F Score86.1 | 6 | |
| Object Segmentation | ZeroWaste-f | J&F Score89.1 | 4 | |
| Object Segmentation | LVIS | J&F Score78.6 | 2 | |
| Object Segmentation | Cityscapes | J&F Score66.7 | 2 | |
| Object Segmentation | VISOR | J&F Score76.6 | 2 | |
| Object Segmentation | OVIS | J&F Score86.8 | 2 | |
| Segment Anything | TrashCan | PAvPU70.5 | 2 | |
| Segment Anything | iShape | PAvPU78.3 | 2 | |
| Segment Anything | ZeroWaste | PAvPU83 | 2 |