Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking

About

Active Object Tracking (AOT) aims to maintain a specific relation between the tracker and object(s) by autonomously controlling the motion system of a tracker given observations. AOT has wide-ranging applications, such as in mobile robots and autonomous driving. However, building a generalizable active tracker that works robustly across different scenarios remains a challenge, especially in unstructured environments with cluttered obstacles and diverse layouts. We argue that constructing a state representation capable of modeling the geometry structure of the surroundings and the dynamics of the target is crucial for achieving this goal. To address this challenge, we present RSPT, a framework that forms a structure-aware motion representation by Reconstructing the Surroundings and Predicting the target Trajectory. Additionally, we enhance the generalization of the policy network by training in an asymmetric dueling mechanism. We evaluate RSPT on various simulated scenarios and show that it outperforms existing methods in unseen environments, particularly those with complex obstacles and layouts. We also demonstrate the successful transfer of RSPT to real-world settings. Project Website: https://sites.google.com/view/aot-rspt.

Fangwei Zhong, Xiao Bi, Yudi Zhang, Wei Zhang, Yizhou Wang• 2023

Related benchmarks

TaskDatasetResultRank
Visual Active TrackingUnrealCV Parking Lot scene
EL480
21
Embodied Visual TrackingSimpleRoom Unseen Virtual Environment
EL500
16
Embodied Visual TrackingUrbanCity Unseen Virtual Environment
EL500
16
Visual Active TrackingUnrealCV Snow Village scene
EL410
11
Visual Active TrackingUnrealCV
EL478
11
Visual Active TrackingUnrealCV UrbanRoad scene
EL500
11
Showing 6 of 6 rows

Other info

Follow for update