studentSplat: Your Student Model Learns Single-view 3D Gaussian Splatting
About
Recent advance in feed-forward 3D Gaussian splatting has enable remarkable multi-view 3D scene reconstruction or single-view 3D object reconstruction but single-view 3D scene reconstruction remain under-explored due to inherited ambiguity in single-view. We present \textbf{studentSplat}, a single-view 3D Gaussian splatting method for scene reconstruction. To overcome the scale ambiguity and extrapolation problems inherent in novel-view supervision from a single input, we introduce two techniques: 1) a teacher-student architecture where a multi-view teacher model provides geometric supervision to the single-view student during training, addressing scale ambiguity and encourage geometric validity; and 2) an extrapolation network that completes missing scene context, enabling high-quality extrapolation. Extensive experiments show studentSplat achieves state-of-the-art single-view novel-view reconstruction quality and comparable performance to multi-view methods at the scene level. Furthermore, studentSplat demonstrates competitive performance as a self-supervised single-view depth estimation method, highlighting its potential for general single-view 3D understanding tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Depth Estimation | DIODE | AbsRel40.7 | 93 | |
| Novel View Synthesis | ACID | PSNR26.94 | 51 | |
| Novel View Reconstruction | RE10K | PSNR24.98 | 12 | |
| Novel View Reconstruction | DTU cross-dataset | PSNR14.15 | 5 | |
| Novel View Reconstruction | ACID cross-dataset | PSNR26.59 | 5 | |
| Single-view depth estimation | DA-2K | Accuracy70.8 | 3 |