Learned Initializations for Optimizing Coordinate-Based Neural Representations
About
Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized weights for each new signal is inefficient. We propose applying standard meta-learning algorithms to learn the initial weight parameters for these fully-connected networks based on the underlying class of signals being represented (e.g., images of faces or 3D models of chairs). Despite requiring only a minor change in implementation, using these learned initial weights enables faster convergence during optimization and can serve as a strong prior over the signal class being modeled, resulting in better generalization when only partial observations of a given signal are available. We explore these benefits across a variety of tasks, including representing 2D images, reconstructing CT scans, and recovering 3D shapes and scenes from 2D image observations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Reconstruction | ImageNet 256x256 | -- | 93 | |
| Novel View Synthesis | DTU 6-view | PSNR18.8 | 49 | |
| Novel View Synthesis | DTU 3-view | PSNR18.2 | 47 | |
| Novel View Synthesis | DTU 9-view | PSNR20.2 | 22 | |
| Novel View Synthesis | ShapeNet cars category | PSNR22.8 | 20 | |
| View Synthesis | Redwood-3dscan (test) | PSNR15.1 | 19 | |
| Depth Estimation | Redwood-3dscan (test) | Depth Error Rate20.84 | 15 | |
| Image Reconstruction | CelebA (test) | -- | 15 | |
| View Synthesis | NeRF Real 5-view | PSNR13.8 | 13 | |
| View Synthesis | NeRF Real 10-view | PSNR14.3 | 13 |