Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AnyUp: Universal Feature Upsampling

About

We introduce AnyUp, a method for feature upsampling that can be applied to any vision feature at any resolution, without encoder-specific training. Existing learning-based upsamplers for features like DINO or CLIP need to be re-trained for every feature extractor and thus do not generalize to different feature types at inference time. In this work, we propose an inference-time feature-agnostic upsampling architecture to alleviate this limitation and improve upsampling quality. In our experiments, AnyUp sets a new state of the art for upsampled features, generalizes to different feature types, and preserves feature semantics while being efficient and easy to apply to a wide range of downstream tasks.

Thomas Wimmer, Prune Truong, Marie-Julie Rakotosaona, Michael Oechsle, Federico Tombari, Bernt Schiele, Jan Eric Lenssen• 2025

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU42.25
3069
Semantic segmentationADE20K
mIoU42.26
559
Semantic segmentationCOCO Stuff
mIoU62.16
399
Semantic segmentationPASCAL VOC (val)
mIoU84.33
380
Semantic segmentationPascal VOC
mIoU0.84
280
Monocular Depth EstimationNYU V2
Delta 1 Acc92.33
174
Semantic segmentationCOCO Stuff (val)
mIoU62.08
167
Depth EstimationNYU V2--
167
Semantic segmentationPascal VOC
mIoU83.85
159
Video Object SegmentationDAVIS
J & F Mean72.44
128
Showing 10 of 21 rows

Other info

Follow for update