Turbo Training with Token Dropout

About

The objective of this paper is an efficient training method for video tasks. We make three contributions: (1) We propose Turbo training, a simple and versatile training paradigm for Transformers on multiple video tasks. (2) We illustrate the advantages of Turbo training on action classification, video-language representation learning, and long-video activity classification, showing that Turbo training can largely maintain competitive performance while achieving almost 4X speed-up and significantly less memory consumption. (3) Turbo training enables long-schedule video-language training and end-to-end long-video training, delivering competitive or superior performance than previous works, which were infeasible to train under limited resources.

Tengda Han, Weidi Xie, Andrew Zisserman• 2022

Related benchmarks

Task	Dataset	Result
Action Recognition	Breakfast	Top-1 Accuracy91.3	28
Video Classification	COIN (test)	Top-1 Accuracy87.5	20
Action Recognition	COIN	Top-1 Acc87.5	12

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord