Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Implicit Neural Representation Facilitates Unified Universal Vision Encoding

About

Models for image representation learning are typically designed for either recognition or generation. Various forms of contrastive learning help models learn to convert images to embeddings that are useful for classification, detection, and segmentation. On the other hand, models can be trained to reconstruct images with pixel-wise, perceptual, and adversarial losses in order to learn a latent space that is useful for image generation. We seek to unify these two directions with a first-of-its-kind model that learns representations which are simultaneously useful for recognition and generation. We train our model as a hyper-network for implicit neural representation, which learns to map images to model weights for fast, accurate reconstruction. We further integrate our INR hyper-network with knowledge distillation to improve its generalization and performance. Beyond the novel training design, the model also learns an unprecedented compressed embedding space with outstanding performance for various visual tasks. The complete model competes with state-of-the-art results for image representation learning, while also enabling generative capabilities with its high-quality tiny embeddings. The code is available at https://github.com/tiktok/huvr.

Matthew Gwilliam, Xiao Wang, Xuefeng Hu, Zhenheng Yang• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU53.5
2731
Image ClassificationImageNet-1K--
524
Image ClassificationFood-101
Accuracy91.2
494
Image ClassificationStanford Cars
Accuracy88.1
477
Image ClassificationImageNet-ReaL
Precision@185.6
195
Image ClassificationObjectNet--
177
Image ReconstructionImageNet1K (val)--
83
Image ClassificationOxford 102 Flowers
Top-1 Accuracy99.6
68
Class-conditional Image GenerationImageNet-1k (val)
FID24.53
68
Depth EstimationNYU v2 (val)
RMSE0.3263
53
Showing 10 of 15 rows

Other info

Follow for update