BridgeShape: Latent Diffusion Schr\"odinger Bridge for 3D Shape Completion

About

Existing diffusion-based 3D shape completion methods typically use a conditional paradigm, injecting incomplete shape information into the denoising network via deep feature interactions (e.g., concatenation, cross-attention) to guide sampling toward complete shapes, often represented by voxel-based distance functions. However, these approaches fail to explicitly model the optimal global transport path, leading to suboptimal completions. Moreover, performing diffusion directly in voxel space imposes resolution constraints, limiting the generation of fine-grained geometric details. To address these challenges, we propose BridgeShape, a novel framework for 3D shape completion via latent diffusion Schr\"odinger bridge. The key innovations lie in two aspects: (i) BridgeShape formulates shape completion as an optimal transport problem, explicitly modeling the transition between incomplete and complete shapes to ensure a globally coherent transformation. (ii) We introduce a Depth-Enhanced Vector Quantized Variational Autoencoder (VQ-VAE) to encode 3D shapes into a compact latent space, leveraging self-projected multi-view depth information enriched with strong DINOv2 features to enhance geometric structural perception. By operating in a compact yet structurally informative latent space, BridgeShape effectively mitigates resolution constraints and enables more efficient and high-fidelity 3D shape completion. BridgeShape achieves state-of-the-art performance on large-scale 3D shape completion benchmarks, demonstrating superior fidelity at higher resolutions and for unseen object classes.

Dequan Kong, Honghua Chen, Zhe Zhu, Mingqiang Wei• 2025

Related benchmarks

Task	Dataset	Result
3D Shape Completion	ShapeNet synthetic objects (unseen categories)	Average CD4.06	71
3D Shape Completion	ScanNet real-world objects, unseen categories Scan2CAD (test)	Average CD6.99	57
Point Cloud Completion	3D-EPN chair class	CD (l1, x10^3)0.0156	2

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord