FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control

About

Reinforcement learning (RL) has driven significant progress in robotics, but its complexity and long training times remain major bottlenecks. In this report, we introduce FastTD3, a simple, fast, and capable RL algorithm that significantly speeds up training for humanoid robots in popular suites such as HumanoidBench, IsaacLab, and MuJoCo Playground. Our recipe is remarkably simple: we train an off-policy TD3 agent with several modifications -- parallel simulation, large-batch updates, a distributional critic, and carefully tuned hyperparameters. FastTD3 solves a range of HumanoidBench tasks in under 3 hours on a single A100 GPU, while remaining stable during training. We also provide a lightweight and easy-to-use implementation of FastTD3 to accelerate RL research in robotics.

Younggyo Seo, Carmelo Sferrazza, Haoran Geng, Michal Nauman, Zhao-Heng Yin, Pieter Abbeel• 2025

Related benchmarks

Task	Dataset	Result
Place Strawberry In Bowl	ANYTASK	Success Rate1.5	7
Stack Banana on Can	ANYTASK	Success Rate0.4	7
Lift Banana	ANYTASK	Success Rate0.00e+0	7
Lift Brick	ANYTASK	Success Rate0.00e+0	7
Lift Peach	ANYTASK	Success Rate0.00e+0	7
open drawer	ANYTASK	Success Rate0.00e+0	7
Push Pear to Center	ANYTASK	Success Rate0.00e+0	7
Put Object In Closed Drawer	ANYTASK	Success Rate0.00e+0	7

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord