SeedEdit: Align Image Re-Generation to Image Editing

About

We introduce SeedEdit, a diffusion model that is able to revise a given image with any text prompt. In our perspective, the key to such a task is to obtain an optimal balance between maintaining the original image, i.e. image reconstruction, and generating a new image, i.e. image re-generation. To this end, we start from a weak generator (text-to-image model) that creates diverse pairs between such two directions and gradually align it into a strong image editor that well balances between the two tasks. SeedEdit can achieve more diverse and stable editing capability over prior image editing methods, enabling sequential revision over images generated by diffusion models.

Yichun Shi, Peng Wang, Weilin Huang• 2024

Related benchmarks

Task	Dataset	Result
Image Editing	GEdit-Bench English	G_O (Overall Quality)6.75	94
Image Editing	GEdit-Bench-EN v1.0 (Full set)	G Score (SC)7.222	22
Image Editing	GEdit-Bench-EN Intersection subset v1.0	G_SC7.396	19
Image Editing	PSR, RealEdit, and UltraEdit Combined	Average Score8.41	14
Image Editing	RealEdit	GPT-4o Preference Score8.24	14
Image Editing	PSR	GPT-4o Score8.45	14
Image Editing	UltraEdit	GPT-4o Score8.42	14
Object Replacement and Style Blending	Object Replacement and Style Blending (800 pairs) (test)	BOSM0.465	11
Object Replacement and Object Blending	Unsplash 4,000 samples (test)	BOM0.5486	10
Image Editing	GEdit-Bench-CN 1.0 (Full set)	G_SC (Generated Content Score)7.168	10

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord