Editing Conditional Radiance Fields
About
A neural radiance field (NeRF) is a scene model supporting high-quality view synthesis, optimized per scene. In this paper, we explore enabling user editing of a category-level NeRF - also known as a conditional radiance field - trained on a shape category. Specifically, we introduce a method for propagating coarse 2D user scribbles to the 3D space, to modify the color or shape of a local region. First, we propose a conditional radiance field that incorporates new modular network components, including a shape branch that is shared across object instances. Observing multiple instances of the same category, our model learns underlying part semantics without any supervision, thereby allowing the propagation of coarse 2D user scribbles to the entire 3D region (e.g., chair seat). Next, we propose a hybrid network update strategy that targets specific network components, which balances efficiency and accuracy. During user interaction, we formulate an optimization problem that both satisfies the user's constraints and preserves the original object structure. We demonstrate our approach on various editing tasks over three shape datasets and show that it outperforms prior neural editing approaches. Finally, we edit the appearance and shape of a real photograph and show that the edit propagates to extrapolated novel views.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | DTU (test) | PSNR20.15 | 101 | |
| Novel View Synthesis | NeRF Synthetic (test) | PSNR15.31 | 46 | |
| Color Editing | PhotoShapes | PSNR35.25 | 6 | |
| Color Editing | PhotoShapes (test) | PSNR35.25 | 5 | |
| 3D Scene Editing | NeRF Syn. | Editing Time28 | 5 | |
| 3D Scene Editing | PhotoShape | Editing Time19 | 5 | |
| 3D Scene Editing | Phy. Syn. | Editing Time37 | 5 | |
| 3D Scene Editing | DTU | Editing Time76 | 5 | |
| Appearance Editing | Phy. Syn. | FID270 | 5 | |
| Appearance Editing | NeRF Syn. | FID196 | 5 |