STRinGS: Selective Text Refinement in Gaussian Splatting
About
Text as signs, labels, or instructions is a critical element of real-world scenes as they can convey important contextual information. 3D representations such as 3D Gaussian Splatting (3DGS) struggle to preserve fine-grained text details, while achieving high visual fidelity. Small errors in textual element reconstruction can lead to significant semantic loss. We propose STRinGS, a text-aware, selective refinement framework to address this issue for 3DGS reconstruction. Our method treats text and non-text regions separately, refining text regions first and merging them with non-text regions later for full-scene optimization. STRinGS produces sharp, readable text even in challenging configurations. We introduce a text readability measure OCR Character Error Rate (CER) to evaluate the efficacy on text regions. STRinGS results in a 63.6% relative improvement over 3DGS at just 7K iterations. We also introduce a curated dataset STRinGS-360 with diverse text scenarios to evaluate text readability in 3D reconstruction. Our method and dataset together push the boundaries of 3D scene understanding in text-rich environments, paving the way for more robust text-aware reconstruction methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text Reconstruction | TandT | CER9.9 | 12 | |
| Text Reconstruction | DL3DV-10K | CER12.3 | 12 | |
| Text Reconstruction | STRinGS-360 | CER0.106 | 12 | |
| Novel View Synthesis | TandT (7K iterations) | Training Time (min)1.1 | 6 | |
| Novel View Synthesis | TandT (30K iterations) | Training Time (min)9.6 | 6 | |
| Novel View Synthesis | DL3DV-10K 7K iterations | Training Time (min)2.1 | 6 | |
| Novel View Synthesis | DL3DV-10K (30K iterations) | Training Time (min)11.4 | 6 | |
| Novel View Synthesis | STRinGS-360 (7K iterations) | Training Time (min)1.9 | 6 | |
| Novel View Synthesis | STRinGS-360 (30K iterations) | Training Time (min)12.6 | 6 | |
| 3D Scene Reconstruction | STRinGS-360 | PSNR29 | 6 |