Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AdaptSplat: Adapting Vision Foundation Models for Feed-Forward 3D Gaussian Splatting

About

This work explores a simple yet powerful lightweight adapter design for feed-forward 3D Gaussian Splatting (3DGS). Existing methods typically apply complex, architecture-specific designs on top of the generic pipeline of image feature extraction $\rightarrow$ multi-view interaction $\rightarrow$ feature decoding. However, constrained by the scale bottleneck of 3D training data and the low-pass filtering effect of deep networks, these methods still fall short in cross-domain generalization and high-frequency geometric fidelity. To address these problems, we propose AdaptSplat, which demonstrates that without complex component engineering, introducing a single adapter of only 1.5M parameters into the generic architecture is sufficient to achieve superior performance. Specifically, we design a lightweight Frequency-Preserving Adapter (FPA) that extracts direction-aware high-frequency structural priors from the shallow features of a powerful vision foundation model backbone, and seamlessly integrates them into the generic pipeline via high-frequency positional encodings and adaptive residual modulation. This effectively compensates for the high-frequency attenuation caused by over-smoothing in deep features, improving the fitting accuracy of Gaussian primitives on complex surfaces and sharp boundaries. Extensive experiments demonstrate that AdaptSplat achieves state-of-the-art feed-forward reconstruction performance on multiple standard benchmarks, with stable generalization across domains. Code available at: https://github.com/xmw666/AdaptSplat.

Mingwei Xing, Xinliang Wang, Yifeng Shi• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisMip-NeRF360
PSNR25.6
184
Novel View SynthesisRE10K
SSIM95.6
161
Novel View SynthesisDL3DV
PSNR29.27
75
Novel View SynthesisDL3DV 6view
PSNR25.795
34
Novel View SynthesisDL3DV 12 views
PSNR26.741
29
Novel View SynthesisDL3DV 24 views
PSNR26.901
28
Whole-House ReconstructionRealSee3D (held-out)
PSNR21.5156
8
Novel View SynthesisRE10K
PSNR33.86
5
Showing 8 of 8 rows

Other info

Follow for update