Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Borrowing from anything: A generalizable framework for reference-guided instance editing

About

Reference-guided instance editing is fundamentally limited by semantic entanglement, where a reference's intrinsic appearance is intertwined with its extrinsic attributes. The key challenge lies in disentangling what information should be borrowed from the reference, and determining how to apply it appropriately to the target. To tackle this challenge, we propose GENIE, a Generalizable Instance Editing framework capable of achieving explicit disentanglement. GENIE first corrects spatial misalignments with a Spatial Alignment Module (SAM). Then, an Adaptive Residual Scaling Module (ARSM) learns what to borrow by amplifying salient intrinsic cues while suppressing extrinsic attributes, while a Progressive Attention Fusion (PAF) mechanism learns how to render this appearance onto the target, preserving its structure. Extensive experiments on the challenging AnyInsertion dataset demonstrate that GENIE achieves state-of-the-art fidelity and robustness, setting a new standard for disentanglement-based instance editing.

Shengxiao Zhou, Chenghua Li, Jianhao Huang, Qinghao Hu, Yifan Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Image InsertionAnyInsertion Object 512x512 (test)
PSNR26.3391
5
Image InsertionAnyInsertion Garment 512x512 (test)
PSNR23.8932
5
Image InsertionAnyInsertion Person 512x512 (test)
PSNR24.1848
5
Showing 3 of 3 rows

Other info

Follow for update