Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

About

When deploying segmentation models in practice, it is critical to evaluate their behaviors in varied and complex scenes. Different from the previous evaluation paradigms only in consideration of global attribute variations (e.g. adverse weather), we investigate both local and global attribute variations for robustness evaluation. To achieve this, we construct a mask-preserved attribute editing pipeline to edit visual attributes of real images with precise control of structural information. Therefore, the original segmentation labels can be reused for the edited images. Using our pipeline, we construct a benchmark covering both object and image attributes (e.g. color, material, pattern, style). We evaluate a broad variety of semantic segmentation models, spanning from conventional close-set models to recent open-vocabulary large models on their robustness to different types of variations. We find that both local and global attribute variations affect segmentation performances, and the sensitivity of models diverges across different variation types. We argue that local attributes have the same importance as global attributes, and should be considered in the robustness evaluation of segmentation models. Code: https://github.com/PRIS-CV/Pascal-EA.

Zijin Yin, Kongming Liang, Bing Li, Zhanyu Ma, Jun Guo• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationACDC Snow (test)--
20
Semantic segmentationPascal VOC Wood (test)--
15
Semantic segmentationPascal-EA 1.0 (val)--
11
Color EditingPascal VOC 16
DINO Dist0.001
7
Material EditingPascal VOC 16
DINO Distance0.003
7
Weather TransferRain condition images (test)
CLIP Accuracy98.9
4
Weather TransferFog condition images (test)
CLIP Accuracy99.9
4
Weather TransferNight condition images (test)
CLIP Accuracy96.5
4
Weather TransferSnow condition images (test)
CLIP Acc100
3
Showing 9 of 9 rows

Other info

Code

Follow for update