Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Exploiting Spatiotemporal Properties for Efficient Event-Driven Human Pose Estimation

About

Human pose estimation focuses on predicting body keypoints to analyze human motion. Event cameras provide high temporal resolution and low latency, enabling robust estimation under challenging conditions. However, most existing methods convert event streams into dense event frames, which adds extra computation and sacrifices the high temporal resolution of the event signal. In this work, we aim to exploit the spatiotemporal properties of event streams based on point cloud-based framework, designed to enhance human pose estimation performance. We design Event Temporal Slicing Convolution module to capture short-term dependencies across event slices, and combine it with Event Slice Sequencing module for structured temporal modeling. We also apply edge enhancement in point cloud-based event representation to enhance spatial edge information under sparse event conditions to further improve performance. Experiments on the DHP19 dataset show our proposed method consistently improves performance across three representative point cloud backbones: PointNet, DGCNN, and Point Transformer.

Haoxian Zhou, Chuanzhi Xu, Langyi Chen, Haodong Chen, Yuk Ying Chung, Qiang Qu, Xaoming Chen, Weidong Cai• 2025

Related benchmarks

TaskDatasetResultRank
2D Human Pose EstimationDHP19
MPJPE2D6.38
6
3D Human Pose EstimationDHP19
MPJPE3D72.16
6
Showing 2 of 2 rows

Other info

Follow for update