Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects

About

We propose a generative technique to edit 3D shapes, represented as meshes, NeRFs, or Gaussian Splats, in approximately 3 seconds, without the need for running an SDS type of optimization. Our key insight is to cast 3D editing as a multiview image inpainting problem, as this representation is generic and can be mapped back to any 3D representation using the bank of available Large Reconstruction Models. We explore different fine-tuning strategies to obtain both multiview generation and inpainting capabilities within the same diffusion model. In particular, the design of the inpainting mask is an important factor of training an inpainting model, and we propose several masking strategies to mimic the types of edits a user would perform on a 3D shape. Our approach takes 3D generative editing from hours to seconds and produces higher-quality results compared to previous works.

Amir Barda, Matheus Gadelha, Vladimir G. Kim, Noam Aigerman, Amit H. Bermano, Thibault Groueix• 2024

Related benchmarks

TaskDatasetResultRank
3D Editing3D Editing
Time (s)25
11
Multiview text-to-image inpaintingMultiview Inpainting Benchmark 500 images (test)
CLIP Similarity (L)29.01
7
3D Editing3D Editing Benchmark 100 assets 1.0 (test)
CLIP-T Score28.5
6
3D Mesh EditingEdit3D-Bench 300 samples
CD0.124
6
Text-guided 3D Editing57 3D assets (Trellis, GSO, and PartObjaverse-Tiny) 1.0 (test)
CLIP-T0.227
5
Showing 5 of 5 rows

Other info

Code

Follow for update