Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Normalizing Flows for Human Pose Anomaly Detection

About

Video anomaly detection is an ill-posed problem because it relies on many parameters such as appearance, pose, camera angle, background, and more. We distill the problem to anomaly detection of human pose, thus decreasing the risk of nuisance parameters such as appearance affecting the result. Focusing on pose alone also has the side benefit of reducing bias against distinct minority groups. Our model works directly on human pose graph sequences and is exceptionally lightweight (~1K parameters), capable of running on any machine able to run the pose estimation with negligible additional resources. We leverage the highly compact pose representation in a normalizing flows framework, which we extend to tackle the unique characteristics of spatio-temporal pose data and show its advantages in this use case. The algorithm is quite general and can handle training data of only normal examples as well as a supervised setting that consists of labeled normal and abnormal examples. We report state-of-the-art results on two anomaly detection benchmarks - the unsupervised ShanghaiTech dataset and the recent supervised UBnormal dataset.

Or Hirschorn, Shai Avidan• 2022

Related benchmarks

TaskDatasetResultRank
Video Anomaly DetectionCUHK Avenue (Ave) (test)
AUC61.8
203
Video Anomaly DetectionShanghaiTech (test)
AUC0.859
194
Abnormal Event DetectionUCSD Ped2 (test)--
146
Abnormal Event DetectionUCSD Ped2--
132
Video Anomaly DetectionAvenue (test)
AUC (Micro)60.1
85
Video Anomaly DetectionCUHK Avenue
Frame AUC60.05
65
Video Anomaly DetectionShanghaiTech
Micro AUC0.859
51
Video Anomaly DetectionShanghaiTech standard (test)
Frame-Level AUC85.9
50
Video Anomaly DetectionUBnormal (test)
AUC71.8
37
Video Anomaly DetectionUBnormal
AUC79.2
25
Showing 10 of 26 rows

Other info

Code

Follow for update