Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation

About

We introduce a parameter-efficient adaptation method for panel-aware in-context image generation with pre-trained diffusion transformers. The key idea is to compose learnable, panel-specific orthogonal operators onto the backbone's frozen positional encodings. This design provides two desirable properties: (1) isometry, which preserves the geometry of internal features, and (2) same-panel invariance, which maintains the model's pre-trained intra-panel synthesis behavior. Through controlled experiments, we demonstrate that the effectiveness of our adaptation method is not tied to a specific positional encoding design but generalizes across diverse positional encoding regimes. By enabling effective panel-relative conditioning, the proposed method consistently improves in-context image-based instructional editing pipelines, including state-of-the-art approaches.

Sanghyeon Lee, Minwoo Lee, Euijin Shin, Kangyeol Kim, Seunghwan Choi, Jaegul Choo• 2026

Related benchmarks

TaskDatasetResultRank
Instructive image editingMagicBrush (test)
CLIP Image0.9281
37
Subject-driven image generationDreamBooth Dataset 1.0 (test)
DINO Score0.6192
18
Showing 2 of 2 rows

Other info

Follow for update