Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Evidential Deep Learning for Open Set Action Recognition

About

In a real-world scenario, human actions are typically out of the distribution from training data, which requires a model to both recognize the known actions and reject the unknown. Different from image data, video actions are more challenging to be recognized in an open-set setting due to the uncertain temporal dynamics and static bias of human actions. In this paper, we propose a Deep Evidential Action Recognition (DEAR) method to recognize actions in an open testing set. Specifically, we formulate the action recognition problem from the evidential deep learning (EDL) perspective and propose a novel model calibration method to regularize the EDL training. Besides, to mitigate the static bias of video representation, we propose a plug-and-play module to debias the learned representation through contrastive learning. Experimental results show that our DEAR method achieves consistent performance gain on multiple mainstream action recognition models and benchmarks. Code and pre-trained models are available at {\small{\url{https://www.rit.edu/actionlab/dear}}}.

Wentao Bao, Qi Yu, Yu Kong• 2021

Related benchmarks

TaskDatasetResultRank
Action RecognitionUCF101--
365
Object DetectionOOV-VOC (test)
mAP (IV)57.9
13
Object DetectionOOV-COCO (test)
mAPIV26.59
13
Out-of-vocabulary object detectionOOV-COCO (test)
mAP (IV)0.2659
13
Open set action recognitionUCF-101 + HMDB-51
Open Set AUC82.94
7
Open set action recognitionUCF-101 + MiT-v2
Open Set AUC86.99
7
Open Set Temporal Action LocalizationTHUMOS14 open set 1.0 (test)
FAR@9581.42
4
Open Set Temporal Action LocalizationActivityNet open set 1.3 (test)
FAR@9584.01
4
Temporal Action LocalizationTHUMOS14 closed set 1.0 (test)
mAP52.24
4
Showing 9 of 9 rows

Other info

Follow for update