ConceptPrism: Concept Disentanglement in Personalized Diffusion Models via Residual Token Optimization

About

Personalized text-to-image (T2I) generation has emerged as a key application for creating user-specific concepts from a few reference images. The core challenge is concept disentanglement: separating the target concept from irrelevant residual information. Lacking such disentanglement, capturing high-fidelity features often incorporates undesired attributes that conflict with user prompts, compromising the trade-off between concept fidelity and text alignment. While existing methods rely on manual guidance, they often fail to represent intricate visual details and lack scalability. We introduce ConceptPrism, a framework that extracts shared features exclusively through cross-image comparison without external information. We jointly optimize a target token and image-wise residual tokens via reconstruction and exclusion losses. By suppressing shared information in residual tokens, the exclusion loss creates an information vacuum that forces the target token to capture the common concept. Extensive evaluations demonstrate that ConceptPrism achieves accurate concept disentanglement and significantly improves overall performance across diverse and complex visual concepts. The code is available at https://github.com/Minseo-Kimm/ConceptPrism.

Minseo Kim, Minchan Kwon, Dongyeun Lee, Yunho Jeon, Junmo Kim• 2026

Related benchmarks

Task	Dataset	Result	Rank
Personalized Image Generation	DreamBench 30 distinct personalized subjects	CLIP-T0.357		7

Showing 1 of 1 rows

Other info

Follow for update

@wizwand_team Discord