Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains
About
Robust humanoid locomotion requires accurate and globally consistent perception of the surrounding 3D environment. However, existing perception modules, mainly based on depth images or elevation maps, offer only partial and locally flattened views of the environment, failing to capture the full 3D structure. This paper presents Gallant, a voxel-grid-based framework for humanoid locomotion and local navigation in 3D constrained terrains. It leverages voxelized LiDAR data as a lightweight and structured perceptual representation, and employs a z-grouped 2D CNN to map this representation to the control policy, enabling fully end-to-end optimization. A high-fidelity LiDAR simulation that dynamically generates realistic observations is developed to support scalable, LiDAR-based training and ensure sim-to-real consistency. Experimental results show that Gallant's broader perceptual coverage facilitates the use of a single policy that goes beyond the limitations of previous methods confined to ground-level obstacles, extending to lateral clutter, overhead constraints, multi-level structures, and narrow passages. Gallant also firstly achieves near 100% success rates in challenging scenarios such as stair climbing and stepping onto elevated platforms through improved end-to-end optimization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Locomotion | Stairs (Simulated) | Traversing Rate76.3 | 11 | |
| Humanoid Locomotion | Simulation Plane | Success Rate (Rsucc)100 | 10 | |
| Humanoid Locomotion | Simulation Pole | Success Rate46 | 5 | |
| Humanoid Locomotion | Simulation Narrow Gate | Success Rate (Rsucc)66 | 5 | |
| Humanoid Locomotion | Simulation Jump | Success Rate (Rsucc)68 | 5 | |
| Humanoid Locomotion | Simulation Hurdle | Success Rate (Rsucc)54.67 | 5 | |
| Humanoid Locomotion | Simulation Beam | Success Rate (Rsucc)32 | 5 | |
| Training Efficiency | Simulation | Train Iteration Time (s)34.6 | 4 | |
| Humanoid Local Navigation | Flat Terrain Static Obstacle Environment | Success Rate85.45 | 4 | |
| Humanoid Local Navigation | Unstructured Terrain Static Obstacle Environment | Success Rate60.14 | 4 |