OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation

About

We introduce a parameter-efficient adaptation method for panel-aware in-context image generation with pre-trained diffusion transformers. The key idea is to compose learnable, panel-specific orthogonal operators onto the backbone's frozen positional encodings. This design provides two desirable properties: (1) isometry, which preserves the geometry of internal features, and (2) same-panel invariance, which maintains the model's pre-trained intra-panel synthesis behavior. Through controlled experiments, we demonstrate that the effectiveness of our adaptation method is not tied to a specific positional encoding design but generalizes across diverse positional encoding regimes. By enabling effective panel-relative conditioning, the proposed method consistently improves in-context image-based instructional editing pipelines, including state-of-the-art approaches.

Sanghyeon Lee, Minwoo Lee, Euijin Shin, Kangyeol Kim, Seunghwan Choi, Jaegul Choo• 2026

Related benchmarks

Task	Dataset	Result	Rank
Instructive image editing	MagicBrush (test)	CLIP Image0.9281		53
Subject-driven image generation	DreamBooth Dataset 1.0 (test)	DINO Score0.6192		18

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord