Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Geometry-aware Representations by Sketching

About

Understanding geometric concepts, such as distance and shape, is essential for understanding the real world and also for many vision tasks. To incorporate such information into a visual representation of a scene, we propose learning to represent the scene by sketching, inspired by human behavior. Our method, coined Learning by Sketching (LBS), learns to convert an image into a set of colored strokes that explicitly incorporate the geometric information of the scene in a single inference step without requiring a sketch dataset. A sketch is then generated from the strokes where CLIP-based perceptual loss maintains a semantic similarity between the sketch and the image. We show theoretically that sketching is equivariant with respect to arbitrary affine transformations and thus provably preserves geometric information. Experimental results show that LBS substantially improves the performance of object attribute classification on the unlabeled CLEVR dataset, domain transfer between CLEVR and STL-10 datasets, and for diverse downstream tasks, confirming that LBS provides rich geometric information.

Hyundo Lee, Inwoo Hwang, Hyunsung Go, Won-Seok Choi, Kibeom Kim, Byoung-Tak Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationSTL-10 (test)
Accuracy56.48
357
Fine-Grained Sketch-Based Image Retrieval (FG-SBIR)Shoe V2 (test)
Recall@140.8
63
Bottommost object color inference (BC)CLEVR
BC Accuracy84.09
13
Leftmost object color inference (LC)CLEVR
Accuracy81.79
13
Rightmost object material inferenceCLEVR
Accuracy86.84
13
Rightmost object shape inferenceCLEVR
Accuracy70.03
13
Rightmost object size inferenceCLEVR
Accuracy93.22
13
Third object from right color inferenceCLEVR
Accuracy38.23
13
Rightmost object color inference (RC)CLEVR
Accuracy97.49
13
Shifted rightmost object color inferenceCLEVR
Accuracy (Shifted Rightmost Color)51.56
13
Showing 10 of 17 rows

Other info

Follow for update