Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Perceptual Prediction Framework for Self Supervised Event Segmentation

About

Temporal segmentation of long videos is an important problem, that has largely been tackled through supervised learning, often requiring large amounts of annotated training data. In this paper, we tackle the problem of self-supervised temporal segmentation of long videos that alleviate the need for any supervision. We introduce a self-supervised, predictive learning framework that draws inspiration from cognitive psychology to segment long, visually complex videos into individual, stable segments that share the same semantics. We also introduce a new adaptive learning paradigm that helps reduce the effect of catastrophic forgetting in recurrent neural networks. Extensive experiments on three publicly available datasets - Breakfast Actions, 50 Salads, and INRIA Instructional Videos datasets show the efficacy of the proposed approach. We show that the proposed approach is able to outperform weakly-supervised and other unsupervised learning approaches by up to 24% and have competitive performance compared to fully supervised approaches. We also show that the proposed approach is able to learn highly discriminative features that help improve action recognition when used in a representation learning paradigm.

Sathyanarayanan N. Aakur, Sudeep Sarkar• 2018

Related benchmarks

TaskDatasetResultRank
Action SegmentationBreakfast
MoF42.9
66
Action SegmentationBreakfast (test)--
31
Action SegmentationBreakfast 14
MoF42.9
26
Temporal action segmentation50 Salads granularity (Eval)
MoF60.6
24
Action SegmentationBreakfast Action dataset
MoF42.9
22
Action SegmentationYouTube Instructions (test)
F1 Score (%)39.7
17
Temporal Video SegmentationBreakfast
MoF0.429
14
Action SegmentationYoutube INRIA Instructional (YII)
F1 Score39.7
11
Video segmentationINRIA Instructional Videos
F1 Score39.7
10
Activity RecognitionBreakfast Actions
Precision37.87
8
Showing 10 of 13 rows

Other info

Code

Follow for update