Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Variational Prototyping-Encoder: One-Shot Learning with Prototypical Images

About

In daily life, graphic symbols, such as traffic signs and brand logos, are ubiquitously utilized around us due to its intuitive expression beyond language boundary. We tackle an open-set graphic symbol recognition problem by one-shot classification with prototypical images as a single training example for each novel class. We take an approach to learn a generalizable embedding space for novel tasks. We propose a new approach called variational prototyping-encoder (VPE) that learns the image translation task from real-world input images to their corresponding prototypical images as a meta-task. As a result, VPE learns image similarity as well as prototypical concepts which differs from widely used metric learning based approaches. Our experiments with diverse datasets demonstrate that the proposed VPE performs favorably against competing metric learning based one-shot methods. Also, our qualitative analyses show that our meta-task induces an effective embedding space suitable for unseen data representation.

Junsik Kim, Tae-Hyun Oh, Seokju Lee, Fei Pan, In So Kweon• 2019

Related benchmarks

TaskDatasetResultRank
One-shot ClassificationBelga to Flickr 32-way (test)
Accuracy53.53
10
One-shot ClassificationBelga to TopLogo 11-way (test)
Accuracy57.75
10
One-shot ClassificationGTSRB→GTSRB (22+21)-way
Accuracy83.79
6
One-shot ClassificationGTSRB→TT100K 36-way
Accuracy (%)71.8
5
One-shot ClassificationCTSD→CTSD (23+35)-way
Accuracy72.08
4
One-shot ClassificationBTSC→BTSC (21+41)-way
Accuracy78.46
4
Showing 6 of 6 rows

Other info

Follow for update