Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Self-supervised Video Representation Learning by Pace Prediction

About

This paper addresses the problem of self-supervised video representation learning from a new perspective -- by video pace prediction. It stems from the observation that human visual system is sensitive to video pace, e.g., slow motion, a widely used technique in film making. Specifically, given a video played in natural pace, we randomly sample training clips in different paces and ask a neural network to identify the pace for each video clip. The assumption here is that the network can only succeed in such a pace reasoning task when it understands the underlying video content and learns representative spatio-temporal features. In addition, we further introduce contrastive learning to push the model towards discriminating different paces by maximizing the agreement on similar video content. To validate the effectiveness of the proposed method, we conduct extensive experiments on action recognition and video retrieval tasks with several alternative network architectures. Experimental evaluations show that our approach achieves state-of-the-art performance for self-supervised video representation learning across different network architectures and different benchmarks. The code and pre-trained models are available at https://github.com/laura-wang/video-pace.

Jiangliu Wang, Jianbo Jiao, Yun-Hui Liu• 2020

Related benchmarks

TaskDatasetResultRank
Action RecognitionUCF101
Accuracy77.1
365
Action RecognitionUCF101 (mean of 3 splits)
Accuracy77.1
357
Action RecognitionUCF101 (test)
Accuracy77.1
307
Action RecognitionHMDB51 (test)
Accuracy0.366
249
Action RecognitionHMDB51
Top-1 Acc36.6
225
Action RecognitionHMDB-51 (average of three splits)
Top-1 Acc36.6
204
Video Action RecognitionUCF101
Top-1 Acc87.1
153
Action RecognitionUCF-101
Top-1 Acc87.1
147
Action ClassificationHMDB51 (over all three splits)
Accuracy36.6
121
Action RecognitionUCF101 (Split 1)
Top-1 Acc77.1
105
Showing 10 of 27 rows

Other info

Follow for update