Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Coherent Temporal Synthesis for Incremental Action Segmentation

About

Data replay is a successful incremental learning technique for images. It prevents catastrophic forgetting by keeping a reservoir of previous data, original or synthesized, to ensure the model retains past knowledge while adapting to novel concepts. However, its application in the video domain is rudimentary, as it simply stores frame exemplars for action recognition. This paper presents the first exploration of video data replay techniques for incremental action segmentation, focusing on action temporal modeling. We propose a Temporally Coherent Action (TCA) model, which represents actions using a generative model instead of storing individual frames. The integration of a conditioning variable that captures temporal coherence allows our model to understand the evolution of action features over time. Therefore, action segments generated by TCA for replay are diverse and temporally coherent. In a 10-task incremental setup on the Breakfast dataset, our approach achieves significant increases in accuracy for up to 22% compared to the baselines.

Guodong Ding, Hans Golong, Angela Yao• 2024

Related benchmarks

TaskDatasetResultRank
Temporal action segmentation50Salads
Accuracy66.8
106
Temporal action segmentationGTEA
F1 Score @ 10% Threshold63
99
Temporal action segmentationBreakfast
Accuracy36.6
96
Action SegmentationBreakfast 10 tasks (test)
Acc36
16
Action SegmentationBreakfast blurry task boundary
Acc44.2
8
Action SegmentationBreakfast 5 tasks (test)
Accuracy57.2
8
Action SegmentationYouTube Instructional 5 tasks (test)
Accuracy0.302
8
Showing 7 of 7 rows

Other info

Follow for update