Learned Initializations for Optimizing Coordinate-Based Neural Representations

About

Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized weights for each new signal is inefficient. We propose applying standard meta-learning algorithms to learn the initial weight parameters for these fully-connected networks based on the underlying class of signals being represented (e.g., images of faces or 3D models of chairs). Despite requiring only a minor change in implementation, using these learned initial weights enables faster convergence during optimization and can serve as a strong prior over the signal class being modeled, resulting in better generalization when only partial observations of a given signal are available. We explore these benefits across a variety of tasks, including representing 2D images, reconstructing CT scans, and recovering 3D shapes and scenes from 2D image observations.

Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng• 2020

Related benchmarks

Task	Dataset	Result
Image Reconstruction	ImageNet 256x256	--	202
Novel View Synthesis	DTU 3-view	PSNR18.2	112
Novel View Synthesis	DTU 6-view	PSNR18.8	87
Novel View Synthesis	DTU 9-view	PSNR20.2	60
Novel View Synthesis	ShapeNet cars category	PSNR22.8	20
View Synthesis	Redwood-3dscan (test)	PSNR15.1	19
Image fitting	CelebA-HQ (test)	PSNR53.1	18
Image fitting	AFHQ (test)	PSNR53.3	18
Image Reconstruction	CelebA (test)	--	17
Depth Estimation	Redwood-3dscan (test)	Depth Error Rate20.84	15

Showing 10 of 47 rows

Other info

Follow for update

@wizwand_team Discord