Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Frequency-Guided Action Diffusion via Sub-Frequency Manifold Traversal

About

Learning visuomotor policies via behavior cloning typically involves mimicking expert demonstrations collected by human operators. However, natural human demonstrations inherently contain high-frequency noise, such as intermittent jerks, pauses, and action jitter. Training policies to directly imitate these raw trajectories inevitably causes the model to inherit these suboptimal behaviors. This pathology is particularly pronounced in diffusion-based policies, where iterative denoising steps can inadvertently amplify high-frequency artifacts at the expense of meaningful fine-grained details. To address these limitations, we present a novel frequency-based algorithm that enables implicit spectral maneuvering and smooth action generation. Our method, Frequency Guidance Operator (FGO), steers the generation process of diffusion polices by progressively driving the noisy samples through intermediate sub-frequency manifolds with expanding spectral bands. Validated on 15 robotic manipulation tasks from 5 benchmarks, FGO achieves superior performance in enhancing action smoothness and temporal consistency while preserving the details necessary for successful task execution. Project website: https://henrywjl.github.io/frequency-guidance-operator/

Junlin Wang• 2026

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationDexArt
Success Rate (Bucket)25.3
29
Dexterous ManipulationAdroit
Hammer Success100
17
Visuomotor ControlRoboSuite
Lift Success Rate92.7
4
Visuomotor ControlMimicGen
Success Rate (Three Piece Assembly)39.3
4
Visuomotor ControlRobosuite Can (test)
ATV14.76
4
Showing 5 of 5 rows

Other info

GitHub

Follow for update