Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep Feature Flow for Video Recognition

About

Deep convolutional neutral networks have achieved great success on image recognition tasks. Yet, it is non-trivial to transfer the state-of-the-art image recognition networks to videos as per-frame evaluation is too slow and unaffordable. We present deep feature flow, a fast and accurate framework for video recognition. It runs the expensive convolutional sub-network only on sparse key frames and propagates their deep feature maps to other frames via a flow field. It achieves significant speedup as flow computation is relatively fast. The end-to-end training of the whole architecture significantly boosts the recognition accuracy. Deep feature flow is flexible and general. It is validated on two recent large scale video datasets. It makes a large step towards practical video recognition.

Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei• 2016

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (val)
mIoU69.2
572
Video Object DetectionImageNet VID (val)
mAP (%)73.1
341
Video Semantic SegmentationCityscapes (val)
mIoU70.1
91
Semantic segmentationCamVid
mIoU66
61
Semantic Video SegmentationCityscapes (test)
mIoU68.7
24
Breast Lesion DetectionBLUVD-186 (test)
AP25.8
12
Semantic segmentationUAVid 8 semantic classes (val)
mIoU77.2
12
Semantic segmentationRuralScapes 12 semantic classes (val)
mIoU62.66
12
Showing 8 of 8 rows

Other info

Code

Follow for update