EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images
About
Feed-forward 3D Gaussian Splatting (3DGS) enables efficient one-pass scene reconstruction, providing 3D representations for novel view synthesis without per-scene optimization. However, existing methods typically predict pixel-aligned primitives per-view, producing an excessive number of primitives in dense-view settings and offering no explicit control over the number of predicted Gaussians. To address this, we propose EcoSplat, the first efficiency-controllable feed-forward 3DGS framework that adaptively predicts the 3D representation for any given target primitive count at inference time. EcoSplat adopts a two-stage optimization process. The first stage is Pixel-aligned Gaussian Training (PGT) where our model learns initial primitive prediction. The second stage is Importance-aware Gaussian Finetuning (IGF) stage where our model learns rank primitives and adaptively adjust their parameters based on the target primitive count. Extensive experiments across multiple dense-view settings show that EcoSplat is robust and outperforms state-of-the-art methods under strict primitive-count constraints, making it well-suited for flexible downstream rendering tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Re10K (test) | PSNR25.21 | 66 | |
| Novel View Synthesis | RE10K challenging views (test) | PSNR25.02 | 56 | |
| Novel View Synthesis | ACID 20 (test) | PSNR24.12 | 14 | |
| Novel View Synthesis | RE10K 16 views (test) | PSNR25.27 | 9 | |
| Novel View Synthesis | RE10K 24 views (test) | PSNR25.11 | 9 | |
| Novel View Synthesis | RE10K 24-view setting | Recon Latency (s)0.52 | 9 | |
| Novel View Synthesis | ACID 16-view (test) | PSNR24.17 | 9 | |
| Novel View Synthesis | ACID zero-shot | PSNR24.02 | 9 |