Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Edit Transfer: Learning Image Editing via Vision In-Context Relations

About

We introduce a new setting, Edit Transfer, where a model learns a transformation from just a single source-target example and applies it to a new query image. While text-based methods excel at semantic manipulations through textual prompts, they often struggle with precise geometric details (e.g., poses and viewpoint changes). Reference-based editing, on the other hand, typically focuses on style or appearance and fails at non-rigid transformations. By explicitly learning the editing transformation from a source-target pair, Edit Transfer mitigates the limitations of both text-only and appearance-centric references. Drawing inspiration from in-context learning in large language models, we propose a visual relation in-context learning paradigm, building upon a DiT-based text-to-image model. We arrange the edited example and the query image into a unified four-panel composite, then apply lightweight LoRA fine-tuning to capture complex spatial transformations from minimal examples. Despite using only 42 training samples, Edit Transfer substantially outperforms state-of-the-art TIE and RIE methods on diverse non-rigid scenarios, demonstrating the effectiveness of few-shot visual relation learning.

Lan Chen, Qi Mao, Yuchao Gu, Mike Zheng Shou• 2025

Related benchmarks

TaskDatasetResultRank
Privacy-aware Image Editing (Multimodal Modality)SPPE (test)
SSIM0.7144
5
Privacy-aware Image Editing (Textual Modality)SPPE (test)
SSIM0.7483
5
Privacy-aware Image Editing (Visual Modality)SPPE (test)
SSIM0.76
5
Source IntegritySPPE Visual (test)
SSIM0.6368
5
Source IntegritySPPE Textual (test)
SSIM0.615
5
Source IntegritySPPE Multimodal (test)
SSIM0.616
5
Image Editing (Edit Consistency)InstructP2P
SSIM0.5251
4
Image Editing (Source Integrity)InstructP2P
SSIM0.532
4
Exemplar-based Image EditingHuman Preference Evaluation
Preference Score (Baseline)6.77
4
Exemplar-based Image EditingRelation seen tasks
CLIP-I0.764
4
Showing 10 of 10 rows

Other info

Follow for update