Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning

About

Video learning is an important task in computer vision and has experienced increasing interest over the recent years. Since even a small amount of videos easily comprises several million frames, methods that do not rely on a frame-level annotation are of special importance. In this work, we propose a novel learning algorithm with a Viterbi-based loss that allows for online and incremental learning of weakly annotated video data. We moreover show that explicit context and length modeling leads to huge improvements in video segmentation and labeling tasks andinclude these models into our framework. On several action segmentation benchmarks, we obtain an improvement of up to 10% compared to current state-of-the-art methods.

Alexander Richard, Hilde Kuehne, Ahsan Iqbal, Juergen Gall• 2018

Related benchmarks

TaskDatasetResultRank
Temporal action segmentation50Salads
Accuracy78.7
106
Temporal action segmentationBreakfast
Accuracy74.1
96
Action SegmentationBreakfast
MoF43
66
Action SegmentationBreakfast (test)
MoF43
31
Action SegmentationCOIN
Frame Accuracy21.2
29
Action SegmentationBreakfast 14
MoF43
26
Action SegmentationCOIN (test)
Frame Accuracy21.2
23
Action SegmentationBreakfast Action dataset
MoF43
22
Action Segmentation50Salads mid granularity
MoF49.4
19
Action AlignmentHollywood Extended
IoD48.7
15
Showing 10 of 22 rows

Other info

Code

Follow for update