Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Turbo Training with Token Dropout

About

The objective of this paper is an efficient training method for video tasks. We make three contributions: (1) We propose Turbo training, a simple and versatile training paradigm for Transformers on multiple video tasks. (2) We illustrate the advantages of Turbo training on action classification, video-language representation learning, and long-video activity classification, showing that Turbo training can largely maintain competitive performance while achieving almost 4X speed-up and significantly less memory consumption. (3) Turbo training enables long-schedule video-language training and end-to-end long-video training, delivering competitive or superior performance than previous works, which were infeasible to train under limited resources.

Tengda Han, Weidi Xie, Andrew Zisserman• 2022

Related benchmarks

TaskDatasetResultRank
Action RecognitionBreakfast
Top-1 Accuracy91.3
28
Video ClassificationCOIN (test)
Top-1 Accuracy87.5
20
Action RecognitionCOIN
Top-1 Acc87.5
12
Showing 3 of 3 rows

Other info

Follow for update