Spatiotemporal Residual Networks for Video Action Recognition

About

Two-stream Convolutional Networks (ConvNets) have shown strong performance for human action recognition in videos. Recently, Residual Networks (ResNets) have arisen as a new technique to train extremely deep architectures. In this paper, we introduce spatiotemporal ResNets as a combination of these two approaches. Our novel architecture generalizes ResNets for the spatiotemporal domain by introducing residual connections in two ways. First, we inject residual connections between the appearance and motion pathways of a two-stream architecture to allow spatiotemporal interaction between the two streams. Second, we transform pretrained image ConvNets into spatiotemporal networks by equipping these with learnable convolutional filters that are initialized as temporal residual connections and operate on adjacent feature maps in time. This approach slowly increases the spatiotemporal receptive field as the depth of the model increases and naturally integrates image ConvNet design principles. The whole model is trained end-to-end to allow hierarchical learning of complex spatiotemporal features. We evaluate our novel spatiotemporal ResNet using two widely used action recognition benchmarks where it exceeds the previous state-of-the-art.

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes• 2016

Related benchmarks

Task	Dataset	Result
Action Recognition	UCF101	Accuracy94.6	433
Action Recognition	UCF101 (mean of 3 splits)	Accuracy94.6	357
Action Recognition	HMDB-51 (average of three splits)	Top-1 Acc70.3	204
Action Recognition	HMDB51	3-Fold Accuracy70.3	191
Action Recognition	UCF101 (3 splits)	Accuracy93.4	155
Action Classification	HMDB51 (over all three splits)	Accuracy48.9	121
Action Recognition	HMDB51 (split 1)	--	80
Video Action Recognition	HMDB51 (avg over all splits)	Top-1 Acc70.3	56
Video Classification	UCF101 (averaged over three splits)	Accuracy94.6	39
Action Recognition	HMDB-51 v1	Accuracy66.4	31

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord