Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Localizing Knowledge in Diffusion Transformers

About

Understanding how knowledge is distributed across the layers of generative models is crucial for improving interpretability, controllability, and adaptation. While prior work has explored knowledge localization in UNet-based architectures, Diffusion Transformer (DiT)-based models remain underexplored in this context. In this paper, we propose a model- and knowledge-agnostic method to localize where specific types of knowledge are encoded within the DiT blocks. We evaluate our method on state-of-the-art DiT-based models, including PixArt-alpha, FLUX, and SANA, across six diverse knowledge categories. We show that the identified blocks are both interpretable and causally linked to the expression of knowledge in generated outputs. Building on these insights, we apply our localization framework to two key applications: model personalization and knowledge unlearning. In both settings, our localized fine-tuning approach enables efficient and targeted updates, reducing computational cost, improving task-specific performance, and better preserving general model behavior with minimal interference to unrelated or surrounding content. Overall, our findings offer new insights into the internal structure of DiTs and introduce a practical pathway for more interpretable, efficient, and controllable model editing.

Arman Zarei, Samyadeep Basu, Keivan Rezaei, Zihao Lin, Sayan Nag, Soheil Feizi• 2025

Related benchmarks

TaskDatasetResultRank
Nudity ErasureI2P
Total Count356
38
Utility PreservationMS-COCO 10k
FID32.65
22
Utility PreservationCOCO-10K (val)
FID32.65
20
Concept ErasureCurated Artistic Style
ACCe31.2
14
Violence ErasureI2P
Total649
12
Concept ErasureAbstraction Category color
ACCe28.6
10
Celebrity ErasureCelebA 100 identities
ACCe24.43
10
Concept ErasureEntity Category (e.g., church)
Accuracy17.5
10
Concept ErasureRing-A-Bell (285 prompts)
Attack Success Rate (w/o ATTACK)57.89
5
Concept ErasureI2P
Total Score306
4
Showing 10 of 12 rows

Other info

Follow for update