Disentangled Learning Improves Implicit Neural Representations for Medical Reconstruction
About
Implicit neural representations (INRs) have emerged as a powerful paradigm for medical imaging via physics-informed unsupervised learning. Classical INRs optimize an entire network from scratch for each subject, leading to inefficient training and suboptimal imaging quality. Recent initialization-based approaches attempt to inject population priors into pre-trained networks, yet they rely on high-quality images and often suffer from catastrophic forgetting during fine-tuning. We present DisINR, a novel INR framework that explicitly disentangles shared and subject-specific representations. DisINR introduces a shared encoder-decoder pair and subject-specific encoders, whose features are jointly decoded for image reconstruction. By integrating differentiable forward models, it pre-trains the shared modules directly from limited raw measurements, removing the need for pre-acquired high-quality images. During test-time adaptation, only the subject-specific encoder is optimized, while the shared pair remains frozen, effectively preserving learned priors. Extensive evaluations on three representative medical imaging tasks show that DisINR significantly outperforms state-of-the-art INRs in both reconstruction accuracy and efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sparse-view CT | COVID-19 dataset (unseen) | PSNR33.72 | 12 | |
| Sparse-View CT Reconstruction | DeepLesion (in-domain) | PSNR40.3 | 12 | |
| Sparse-View CT Reconstruction | LIDC (out-of-domain) | PSNR38.85 | 12 | |
| Undersampled MRI Reconstruction | fastMRI T2w (in-domain) | PSNR48.42 | 12 | |
| Undersampled MRI Reconstruction | fastMRI FLAIR (out-of-domain) | PSNR45.64 | 12 | |
| 3D volume fitting | AAPM | Parameter Count (M)10.83 | 7 | |
| MRI Reconstruction | fastMRI T2w Radial AF=10x | PSNR37.22 | 6 | |
| MRI Reconstruction | fastMRI T2w (Poisson, AF=20x) | PSNR36.95 | 6 |