Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DexFormer: Cross-Embodied Dexterous Manipulation via History-Conditioned Transformer

About

Dexterous manipulation remains one of the most challenging problems in robotics, requiring coherent control of high-DoF hands and arms under complex, contact-rich dynamics. A major barrier is embodiment variability: different dexterous hands exhibit distinct kinematics and dynamics, forcing prior methods to train separate policies or rely on shared action spaces with per-embodiment decoder heads. We present DexFormer, an end-to-end, dynamics-aware cross-embodiment policy built on a modified transformer backbone that conditions on historical observations. By using temporal context to infer morphology and dynamics on the fly, DexFormer adapts to diverse hand configurations and produces embodiment-appropriate control actions. Trained over a variety of procedurally generated dexterous-hand assets, DexFormer acquires a generalizable manipulation prior and exhibits strong zero-shot transfer to Leap Hand, Allegro Hand, and Rapid Hand. Our results show that a single policy can generalize across heterogeneous hand embodiments, establishing a scalable foundation for cross-embodiment dexterous manipulation. Project website: https://davidlxu.github.io/DexFormer-web/.

Ke Zhang, Lixin Xu, Chengyi Song, Junzhe Xu, Xiaoyi Lin, Zeyu Jiang, Renjing Xu• 2026

Related benchmarks

TaskDatasetResultRank
Robotic GraspingLEAP canonical
Success Rate0.8325
3
Robotic GraspingLEAP 32 variants (test)
Success Rate86.84
3
Robotic GraspingAllegro canonical
Success Rate74.19
3
Robotic GraspingAllegro 32 variants (test)
Success Rate71.94
3
Robotic GraspingRAPID canonical
Success Rate71.69
3
Robotic GraspingRAPID 32 variants (test)
Success Rate77.09
3
Showing 6 of 6 rows

Other info

Follow for update