Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Temporal Recurrent Networks for Online Action Detection

About

Most work on temporal action detection is formulated as an offline problem, in which the start and end times of actions are determined after the entire video is fully observed. However, important real-time applications including surveillance and driver assistance systems require identifying actions as soon as each video frame arrives, based only on current and historical observations. In this paper, we propose a novel framework, Temporal Recurrent Network (TRN), to model greater temporal context of a video frame by simultaneously performing online action detection and anticipation of the immediate future. At each moment in time, our approach makes use of both accumulated historical evidence and predicted future information to better recognize the action that is currently occurring, and integrates both of these into a unified end-to-end architecture. We evaluate our approach on two popular online action detection datasets, HDD and TVSeries, as well as another widely used dataset, THUMOS'14. The results show that TRN significantly outperforms the state-of-the-art.

Mingze Xu, Mingfei Gao, Yi-Ting Chen, Larry S. Davis, David J. Crandall• 2018

Related benchmarks

TaskDatasetResultRank
Temporal Action DetectionTHUMOS-14 (test)--
330
Online Action DetectionTHUMOS14 (test)
mAP68.5
86
Action DetectionTHUMOS 2014 (test)--
79
Online Action DetectionTVSeries
mcAP86.2
57
Online Action DetectionTVSeries (test)
mcAP86.2
41
Online Action DetectionTHUMOS 14
Mean F-AP62.1
37
Online Action DetectionHDD
Overall mAP40.8
29
Action AnticipationTVSeries (test)
mcAP75.7
22
Action AnticipationTHUMOS-14 (test)
mAP38.9
14
Action AnticipationTHUMOS 2014
mAP (Avg)38.9
14
Showing 10 of 17 rows

Other info

Follow for update