Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Orthogonal Negative Guidance in Attention Feature Space for Text-to-Image Generation

About

Text-to-image (T2I) models have become increasingly capable of generating high-quality images. Yet, enforcing the explicit absence of a specified object or attribute remains a fundamentally challenging problem. Existing approaches, including prompt negation, post-hoc editing, and negative guidance, remain insufficient for explicit concept suppression, often failing to remove the target concept or degrading overall image quality. To this end, we propose Orthogonal Negative Guidance in attention feature space, a training-free method that operates in the attention output space of MM-DiT-based T2I transformers. Our method orthogonalizes negative-prompt attention features with respect to positive-prompt features and subtracts only the orthogonal component, suppressing unwanted concepts while preserving desired semantics. Experiments on FLUX-dev and FLUX-schnell show that our method achieves favorable trade-offs between concept suppression, prompt alignment, and image quality. In human evaluation, our method outperforms the second-best baseline by 18.78%. We further show that our method supports multi-concept suppression and adjustable concept suppression.

Jungmin Ko, Jungwon Park, Jimyeong Kim, Changin Choi, Wonseok Lee, Wonjong Rhee• 2026

Related benchmarks

TaskDatasetResultRank
Negative Concept SuppressionLLM-generated prompts
Suppression Rate (%)84.25
10
Negative Concept SuppressionCOCO derived prompts
Suppression (%)86.25
10
Negative Concept SuppressionCombined LLM-generated + COCO-derived
Suppression Rate85.25
10
Negative Concept SuppressionDCS-Bench LLM-generated prompts on FLUX (dev)
Negative Concept Suppression (%)85.25
10
Negative Concept SuppressionDCS-Bench COCO-derived prompts FLUX (dev)
Suppression Rate (%)88.5
10
Negative Concept SuppressionDCS-Bench Combined FLUX (dev)
Negative Concept Suppression (%)86.88
10
Text-to-Image GenerationNVIDIA A6000 GPU Environment
Inference Time (s)46.73
9
Negative Concept SuppressionDCS-Bench
Human Preference Score70.81
5
Negative Concept SuppressionFlux (dev)
Negative Concept Suppression86.88
3
Showing 9 of 9 rows

Other info

Follow for update