CLIPasso: Semantically-Aware Object Sketching

About

Abstraction is at the heart of sketching due to the simple and minimal nature of line drawings. Abstraction entails identifying the essential visual properties of an object or scene, which requires semantic understanding and prior knowledge of high-level concepts. Abstract depictions are therefore challenging for artists, and even more so for machines. We present CLIPasso, an object sketching method that can achieve different levels of abstraction, guided by geometric and semantic simplifications. While sketch generation methods often rely on explicit sketch datasets for training, we utilize the remarkable ability of CLIP (Contrastive-Language-Image-Pretraining) to distill semantic concepts from sketches and images alike. We define a sketch as a set of B\'ezier curves and use a differentiable rasterizer to optimize the parameters of the curves directly with respect to a CLIP-based perceptual loss. The abstraction degree is controlled by varying the number of strokes. The generated sketches demonstrate multiple levels of abstraction while maintaining recognizability, underlying structure, and essential visual components of the subject drawn.

Yael Vinker, Ehsan Pajouheshgar, Jessica Y. Bo, Roman Christian Bachmann, Amit Haim Bermano, Daniel Cohen-Or, Amir Zamir, Ariel Shamir• 2022

Related benchmarks

Task	Dataset	Result
Dynamic 3D Sketching	Synthetic and Real-world video scenes Fixed views	Structural Alignment (MS-SSIM)74	12
Image-guided portrait sketch generation	CelebA-HQ 1024x1024 (randomly sampled 50 images)	LPIPS0.572	9
Dynamic 3D Sketching	Synthetic and Real-world video scenes Novel views	MS-SSIM Structural Alignment0.78	8
Attribute Classification	STL-10	RC12.55	5
Stroke-based generation	STL-10 (test)	RC12.55	5
3D Motion Sketch Generation	User Study Novel views	Motion Score3.32	4
Dynamic 3D Sketching	Synthetic and Real-world video scenes Novel views (test)	Structural Alignment (MS-SSIM)76	4
Sketch Authenticity Assessment	Google Image and Synthesized Sketches User Study	Confusion Score0.39	4
3D Motion Sketch Generation	User Study Fixed views	Motion Quality Score2.87	4

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord