Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose

About

Animal pose estimation is challenging for existing image-based methods because of limited training data and large intra- and inter-species variances. Motivated by the progress of visual-language research, we propose that pre-trained language models (e.g., CLIP) can facilitate animal pose estimation by providing rich prior knowledge for describing animal keypoints in text. However, we found that building effective connections between pre-trained language models and visual animal keypoints is non-trivial since the gap between text-based descriptions and keypoint-based visual features about animal pose can be significant. To address this issue, we introduce a novel prompt-based Contrastive learning scheme for connecting Language and AniMal Pose (CLAMP) effectively. The CLAMP attempts to bridge the gap by adapting the text prompts to the animal keypoints during network training. The adaptation is decomposed into spatial-aware and feature-aware processes, and two novel contrastive losses are devised correspondingly. In practice, the CLAMP enables the first cross-modal animal pose estimation paradigm. Experimental results show that our method achieves state-of-the-art performance under the supervised, few-shot, and zero-shot settings, outperforming image-based methods by a large margin.

Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, Dacheng Tao• 2022

Related benchmarks

TaskDatasetResultRank
Animal Pose EstimationAP-10K (test)
mAP77.8
55
Animal Pose EstimationAP-10K (val)
AP77.8
17
Animal Pose EstimationAP-10K 20-shot 1.0 (test)
AP61.2
5
Animal Pose EstimationAnimal-Pose (test)
AP74.3
5
Animal Pose EstimationAP-10K Equidae 1.0 (test)
AP46.6
4
Animal Pose EstimationAnimalKingdom Mammals Amphibians 1.0 (train test)
PCK@0.0512.3
4
Animal Pose EstimationAnimalKingdom Reptiles Amphibians 1.0 (train test)
PCK@0.0520.8
4
Animal Pose EstimationAnimalKingdom Fishes Birds 1.0 (train test)
PCK@0.059.5
4
Animal Pose EstimationAnimalKingdom (test)
PCK@0.05 (P1-All)65.6
4
Animal Pose EstimationAP-10K Canidae (test)
AP46.9
2
Showing 10 of 16 rows

Other info

Code

Follow for update