Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep Convolutional Ranking for Multilabel Image Annotation

About

Multilabel image annotation is one of the most important challenges in computer vision with many real-world applications. While existing work usually use conventional visual features for multilabel annotation, features based on Deep Neural Networks have shown potential to significantly boost performance. In this work, we propose to leverage the advantage of such features and analyze key components that lead to better performances. Specifically, we show that a significant performance gain could be obtained by combining convolutional architectures with approximate top-$k$ ranking objectives, as thye naturally fit the multilabel tagging problem. Our experiments on the NUS-WIDE dataset outperforms the conventional visual features by about 10%, obtaining the best reported performance in the literature.

Yunchao Gong, Yangqing Jia, Thomas Leung, Alexander Toshev, Sergey Ioffe• 2013

Related benchmarks

TaskDatasetResultRank
Multi-Label ClassificationNUS-WIDE
mAP3.1
38
Multi-label Image ClassificationNUS-WIDE 81 concept labels (test)
F1 (Class)33.5
29
Multi-label Image ClassificationMS-COCO (val)
F1 (C)55.7
25
Multi-Label ClassificationOpen Images (test)
mAP69.9
16
Multi-Label ClassificationNUS-WIDE 81 labels
Precision @ K=349.1
9
Image TaggingNUS-WIDE 81-tag (test)
MiAP48
7
Image TaggingIAPRTC-12 conventional image tagging 291 tags
mAP48
5
Showing 7 of 7 rows

Other info

Follow for update