Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting

About

We present Stylos, a single-forward 3D Gaussian framework for 3D style transfer that operates on unposed content, from a single image to a multi-view collection, conditioned on a separate reference style image. Stylos synthesizes a stylized 3D Gaussian scene without per-scene optimization or precomputed poses, achieving geometry-aware, view-consistent stylization that generalizes to unseen categories, scenes, and styles. At its core, Stylos adopts a Transformer backbone with two pathways: geometry predictions retain self-attention to preserve geometric fidelity, while style is injected via global cross-attention to enforce visual consistency across views. With the addition of a voxel-based 3D style loss that aligns aggregated scene features to style statistics, Stylos enforces view-consistent stylization while preserving geometry. Experiments across multiple datasets demonstrate that Stylos delivers high-quality zero-shot stylization, highlighting the effectiveness of global style-content coupling, the proposed 3D style loss, and the scalability of our framework from single view to large-scale multi-view settings. Our codes are available at https://github.com/HanzhouLiu/Stylos.

Hanzhou Liu, Jia Huang, Mi Lu, Srikanth Saripalli, Peng Jiang• 2025

Related benchmarks

TaskDatasetResultRank
3D StylizationTnT Truck scene
ArtScore9.7
15
3D StylizationTnT (M60 scene)
ArtScore9.37
15
Multi-view consistencyGarden scene short-range AnyStyle
LPIPS0.047
11
Multi-view consistencyTruck scene Short-range AnyStyle
LPIPS0.028
11
Multi-view consistencyM60 scene AnyStyle (short-range)
LPIPS0.035
11
Multi-view consistencyAnyStyle Scene Long-range (train)
LPIPS0.051
11
Multi-view consistencyTruck scene Long-range AnyStyle
LPIPS0.074
11
Multi-view consistencyM60 scene Long-range AnyStyle
LPIPS0.083
11
Multi-view consistencyGarden scene Long-range AnyStyle
LPIPS0.139
11
Short-range Multi-view ConsistencyTanks and Temples short-range
Average LPIPS0.03
11
Showing 10 of 20 rows

Other info

Follow for update