Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Spatially Collaged Fourier Bases for Implicit Neural Representation

About

Existing approaches to Implicit Neural Representation (INR) can be interpreted as a global scene representation via a linear combination of Fourier bases of different frequencies. However, such universal basis functions can limit the representation capability in local regions where a specific component is unnecessary, resulting in unpleasant artifacts. To this end, we introduce a learnable spatial mask that effectively dispatches distinct Fourier bases into respective regions. This translates into collaging Fourier patches, thus enabling an accurate representation of complex signals. Comprehensive experiments demonstrate the superior reconstruction quality of the proposed approach over existing baselines across various INR tasks, including image fitting, video representation, and 3D shape representation. Our method outperforms all other baselines, improving the image fitting PSNR by over 3dB and 3D reconstruction to 98.81 IoU and 0.0011 Chamfer Distance.

Jason Chun Lok Li, Chang Liu, Binxiao Huang, Ngai Wong• 2023

Related benchmarks

TaskDatasetResultRank
2D Image FittingKodak
PSNR40.29
8
2D Image FittingDIV2K D2K0-D2K7
D2K0 Score35.32
7
Image fittingDIV2K
PSNR (0873)26.1
6
3D Shape RepresentationStanford 3D Scanning Repository
IoU (Thai statue)98.75
6
Showing 4 of 4 rows

Other info

Follow for update