Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Frequency-Enhanced Diffusion Models: Curriculum-Guided Semantic Alignment for Zero-Shot Skeleton Action Recognition

About

Human action recognition is pivotal in computer vision, with applications ranging from surveillance to human-robot interaction. Despite the effectiveness of supervised skeleton-based methods, their reliance on exhaustive annotation limits generalization to novel actions. Zero-Shot Skeleton Action Recognition (ZSAR) emerges as a promising paradigm, yet it faces challenges due to the spectral bias of diffusion models, which oversmooth high-frequency dynamics. Here, we propose Frequency-Aware Diffusion for Skeleton-Text Matching (FDSM), integrating a Semantic-Guided Spectral Residual Module, a Timestep-Adaptive Spectral Loss, and Curriculum-based Semantic Abstraction to address these challenges. Our approach effectively recovers fine-grained motion details, achieving state-of-the-art performance on NTU RGB+D, PKU-MMD, and Kinetics-skeleton datasets. Code has been made available at https://github.com/yuzhi535/FDSM. Project homepage: https://yuzhi535.github.io/FDSM.github.io/

Yuxi Zhou, Zhengbo Zhang, Jingyu Pan, Zhiyu Lin, Zhigang Tu• 2026

Related benchmarks

TaskDatasetResultRank
Skeleton Action RecognitionNTU-120 (96/24 random split)
Accuracy66.52
34
Skeleton Action RecognitionNTU-120 (110/10 random split)
Top-1 Accuracy75.24
24
Skeleton Action RecognitionNTU-60 (55/5 random split)--
23
Skeleton Action RecognitionNTU-60 (48/12 random split)--
15
Skeleton Action RecognitionNTU 60 (55/5 split)
Top-1 Accuracy90.13
12
Skeleton Action RecognitionPKU-MMD (46/5 split)
Top-1 Accuracy72.18
12
Skeleton-based Action RecognitionNTU-60 (40/20 split)
Top-1 Accuracy37.42
10
Skeleton-based Action RecognitionNTU-60 (30/30 split)
Top-1 Accuracy26.55
10
Skeleton-based Action RecognitionNTU-120 (80/40 split)
Top-1 Accuracy39.16
10
Skeleton-based Action RecognitionNTU-120 (60/60 split)
Top-1 Accuracy28.67
10
Showing 10 of 18 rows

Other info

Follow for update