Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations

About

We present a system for learning generalizable hand-object tracking controllers purely from synthetic data, without requiring any human demonstrations. Our approach makes two key contributions: (1) HOP, a Hand-Object Planner, which can synthesize diverse hand-object trajectories; and (2) HOT, a Hand-Object Tracker that bridges synthetic-to-physical transfer through reinforcement learning and interaction imitation learning, delivering a generalizable controller conditioned on target hand-object states. Our method extends to diverse object shapes and hand morphologies. Through extensive evaluations, we show that our approach enables dexterous hands to track challenging, long-horizon sequences including object re-arrangement and agile in-hand reorientation. These results represent a significant step toward scalable foundation controllers for manipulation that can learn entirely from synthetic data, breaking the data bottleneck that has long constrained progress in dexterous manipulation.

Yinhuai Wang, Runyi Yu, Hok Wai Tsui, Xiaoyi Lin, Hui Zhang, Qihan Zhao, Ke Fan, Miao Li, Jie Song, Jingbo Wang, Qifeng Chen, Ping Tan• 2025

Related benchmarks

TaskDatasetResultRank
Average performance across meta skillsMeta Skills
Success Rate (SR)84.69
4
CatchMeta Skills
Success Rate83.59
4
GraspMeta Skills
Success Rate85.97
4
MoveMeta Skills
Success Rate (SR)98.87
4
PlaceMeta Skills
Success Rate94.33
4
RegraspMeta Skills
SR81.92
4
RotateMeta Skills
Success Rate56.23
4
ThrowMeta Skills
Success Rate (SR)91.94
4
Showing 8 of 8 rows

Other info

Follow for update