Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

About

Self-supervised learning has gained prominence due to its efficacy at learning powerful representations from unlabelled data that achieve excellent performance on many challenging downstream tasks. However supervision-free pre-text tasks are challenging to design and usually modality specific. Although there is a rich literature of self-supervised methods for either spatial (such as images) or temporal data (sound or text) modalities, a common pre-text task that benefits both modalities is largely missing. In this paper, we are interested in defining a self-supervised pre-text task for sketches and handwriting data. This data is uniquely characterised by its existence in dual modalities of rasterized images and vector coordinate sequences. We address and exploit this dual representation by proposing two novel cross-modal translation pre-text tasks for self-supervised feature learning: Vectorization and Rasterization. Vectorization learns to map image space to vector coordinates and rasterization maps vector coordinates to image space. We show that the our learned encoder modules benefit both raster-based and vector-based downstream approaches to analysing hand-drawn data. Empirical evidence shows that our novel pre-text tasks surpass existing single and multi-modal self-supervision methods.

Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song• 2021

Related benchmarks

TaskDatasetResultRank
Fine-Grained Sketch-Based Image Retrieval (FG-SBIR)Chair V2 (test)
Top-1 Accuracy60.2
72
Fine-Grained Sketch-Based Image Retrieval (FG-SBIR)Shoe V2 (test)
Recall@139.1
63
Sketch RecognitionQuickDraw (test)
Top-1 Acc65.6
34
Sketch RetrievalQuickDraw (test)
A@T160.4
34
RecognitionQuickDraw Image Space
Top-1 Accuracy71.9
13
RecognitionTU-Berlin Image Space
Top-1 Accuracy70.6
13
RetrievalQuickDraw Image Space
A@T152.3
13
RetrievalTU-Berlin Image Space
A@T147.7
13
Sketch RecognitionQuickDraw Image Space (test)
Top-1 Acc65.1
7
Handwriting RecognitionIAM Online
Accuracy56.7
6
Showing 10 of 21 rows

Other info

Follow for update