ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
About
Our world is marked by unprecedented technological, global, and socio-political transformations, posing a significant challenge to text-to-image generative models. These models encode factual associations within their parameters that can quickly become outdated, diminishing their utility for end-users. To that end, we introduce ReFACT, a novel approach for editing factual associations in text-to-image models without relaying on explicit input from end-users or costly re-training. ReFACT updates the weights of a specific layer in the text encoder, modifying only a tiny portion of the model's parameters and leaving the rest of the model unaffected. We empirically evaluate ReFACT on an existing benchmark, alongside a newly curated dataset. Compared to other methods, ReFACT achieves superior performance in both generalization to related concepts and preservation of unrelated concepts. Furthermore, ReFACT maintains image generation quality, making it a practical tool for updating and correcting factual information in text-to-image models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Backdoor Detoxification | MS-COCO BadT2I_sent | FID (Clean)49.51 | 6 | |
| Backdoor Detoxification | MS-COCO BadT2I_tok | FID (Clean)53.12 | 6 | |
| Backdoor Detoxification | MS-COCO PersonalBKD_dream | FID (Clean)141.1 | 6 | |
| Backdoor Detoxification | MS-COCO Average across attacks | FID (Clean)98.01 | 6 | |
| Backdoor Detoxification | MS-COCO EvilEdit | FID (Clean)148.3 | 6 |