Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

About

We present SliderSpace, a framework for automatically decomposing the visual capabilities of diffusion models into controllable and human-understandable directions. Unlike existing control methods that require a user to specify attributes for each edit direction individually, SliderSpace discovers multiple interpretable and diverse directions simultaneously from a single text prompt. Each direction is trained as a low-rank adaptor, enabling compositional control and the discovery of surprising possibilities in the model's latent space. Through extensive experiments on state-of-the-art diffusion models, we demonstrate SliderSpace's effectiveness across three applications: concept decomposition, artistic style exploration, and diversity enhancement. Our quantitative evaluation shows that SliderSpace-discovered directions decompose the visual structure of model's knowledge effectively, offering insights into the latent capabilities encoded within diffusion models. User studies further validate that our method produces more diverse and useful variations compared to baselines. Our code, data and trained weights are available at https://sliderspace.baulab.info

Rohit Gandikota, Zongze Wu, Richard Zhang, David Bau, Eli Shechtman, Nick Kolkin• 2025

Related benchmarks

TaskDatasetResultRank
Slider ControllabilityFreeSliders (evaluation set)
Range (CR)45.1
7
Video EditingEditVerse
Edit Quality3.743
7
Slider-based Video EditingUser Study Appearance and Motion Sliders
Editing Quality Score2.89
7
Image GenerationReference Manifold Car (test)
Precision16.73
5
Image GenerationReference Manifold Dog (test)
Precision19.01
5
Image GenerationReference Manifold Person (test)
Precision10.19
5
Image GenerationSDXL Generated Concepts Cat (test)
FID21.69
5
Image GenerationGenerated Concepts Person SDXL (test)
FID41.52
5
Image GenerationReference Manifold Cat (test)
Precision0.1349
5
Image GenerationReference Manifold Monster (test)
Precision6.49
5
Showing 10 of 14 rows

Other info

Follow for update