Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation

About

This work proposes a retrieve-and-transfer framework for zero-shot robotic manipulation, dubbed RAM, featuring generalizability across various objects, environments, and embodiments. Unlike existing approaches that learn manipulation from expensive in-domain demonstrations, RAM capitalizes on a retrieval-based affordance transfer paradigm to acquire versatile manipulation capabilities from abundant out-of-domain data. First, RAM extracts unified affordance at scale from diverse sources of demonstrations including robotic data, human-object interaction (HOI) data, and custom data to construct a comprehensive affordance memory. Then given a language instruction, RAM hierarchically retrieves the most similar demonstration from the affordance memory and transfers such out-of-domain 2D affordance to in-domain 3D executable affordance in a zero-shot and embodiment-agnostic manner. Extensive simulation and real-world evaluations demonstrate that our RAM consistently outperforms existing works in diverse daily tasks. Additionally, RAM shows significant potential for downstream applications such as automatic and efficient data collection, one-shot visual imitation, and LLM/VLM-integrated long-horizon manipulation. For more details, please check our website at https://yxkryptonite.github.io/RAM/.

Yuxuan Kuang, Junjie Ye, Haoran Geng, Jiageng Mao, Congyue Deng, Leonidas Guibas, He Wang, Yue Wang• 2024

Related benchmarks

TaskDatasetResultRank
Affordance LocalizationAGD20K
SR0.1824
5
Dynamic Affordance PredictionDROID 70/30 (test)
Open Microwave MAE49.82
4
Dynamic Affordance PredictionDROID & HOI4D Combined
Overall MAE62.84
4
Dynamic Affordance PredictionHOI4D 70/30 (test)
Pickup MAE63.07
4
Robot ManipulationReal-world setting
Put Into1
3
Open/Close ManipulationReal-world (unseen objects)
Open Drawer Success Rate70
3
Open/Close ManipulationReal-world Unseen Categories
Open Cabinet Success Rate70
3
Pickup ManipulationReal-world (unseen objects)
Success Rate (Bowl)85
3
Pickup ManipulationReal-world Unseen Categories
Plate Pickup Success Rate80
3
Showing 9 of 9 rows

Other info

Follow for update