Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AnyUp: Universal Feature Upsampling

About

We introduce AnyUp, a method for feature upsampling that can be applied to any vision feature at any resolution, without encoder-specific training. Existing learning-based upsamplers for features like DINO or CLIP need to be re-trained for every feature extractor and thus do not generalize to different feature types at inference time. In this work, we propose an inference-time feature-agnostic upsampling architecture to alleviate this limitation and improve upsampling quality. In our experiments, AnyUp sets a new state of the art for upsampled features, generalizes to different feature types, and preserves feature semantics while being efficient and easy to apply to a wide range of downstream tasks.

Thomas Wimmer, Prune Truong, Marie-Julie Rakotosaona, Michael Oechsle, Federico Tombari, Bernt Schiele, Jan Eric Lenssen• 2025

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU42.25
2888
Semantic segmentationCOCO Stuff
mIoU62.16
379
Semantic segmentationADE20K
mIoU42.26
366
Semantic segmentationPASCAL VOC (val)
mIoU84.33
362
Semantic segmentationPascal VOC
mIoU0.84
180
Monocular Depth EstimationNYU V2
Delta 1 Acc92.33
131
Semantic segmentationPascal VOC
mIoU83.85
129
Semantic segmentationCOCO Stuff (val)
mIoU62.08
126
Semantic segmentationCOCO
mIoU61.1
103
Depth EstimationNYU V2--
57
Showing 10 of 17 rows

Other info

Follow for update