Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

About

Performing language-conditioned robotic manipulation tasks in unstructured environments is highly demanded for general intelligent robots. Conventional robotic manipulation methods usually learn semantic representation of the observation for action prediction, which ignores the scene-level spatiotemporal dynamics for human goal completion. In this paper, we propose a dynamic Gaussian Splatting method named ManiGaussian for multi-task robotic manipulation, which mines scene dynamics via future scene reconstruction. Specifically, we first formulate the dynamic Gaussian Splatting framework that infers the semantics propagation in the Gaussian embedding space, where the semantic representation is leveraged to predict the optimal robot action. Then, we build a Gaussian world model to parameterize the distribution in our dynamic Gaussian Splatting framework, which provides informative supervision in the interactive environment via future scene reconstruction. We evaluate our ManiGaussian on 10 RLBench tasks with 166 variations, and the results demonstrate our framework can outperform the state-of-the-art methods by 13.1\% in average success rate. Project page: https://guanxinglu.github.io/ManiGaussian/.

Guanxing Lu, Shiyi Zhang, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang• 2024

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationRLBench (test)
Average Success Rate31.5
34
Multi-task Robotic ManipulationRLBench
Avg Success Rate44.8
16
Robot ManipulationRLBench Moderate Shift
Average Success Rate30.7
11
Robot ManipulationRLBench Large Shift
Rel. Drop (Avg)10.7
10
Robot ManipulationRLBench Large Shift (test)
Average SR25.8
8
Robotic ManipulationRLBench Multi-task Train View
Relative Performance Drop0.4
3
Robotic ManipulationRLBench Multi-task Moderate Shift
Relative Performance Drop3.8
3
Robot ManipulationRLBench View (train)
SR (Avg)47.1
3
Showing 8 of 8 rows

Other info

Follow for update