Self-supervised Reinforcement Learning with Independently Controllable Subgoals

About

To successfully tackle challenging manipulation tasks, autonomous agents must learn a diverse set of skills and how to combine them. Recently, self-supervised agents that set their own abstract goals by exploiting the discovered structure in the environment were shown to perform well on many different tasks. In particular, some of them were applied to learn basic manipulation skills in compositional multi-object environments. However, these methods learn skills without taking the dependencies between objects into account. Thus, the learned skills are difficult to combine in realistic environments. We propose a novel self-supervised agent that estimates relations between environment components and uses them to independently control different parts of the environment state. In addition, the estimated relations between objects can be used to decompose a complex goal into a compatible sequence of subgoals. We show that, by using this framework, an agent can efficiently and automatically learn manipulation tasks in multi-object environments with different relations between objects.

Andrii Zadaianchuk, Georg Martius, Fanny Yang• 2021

Related benchmarks

Task	Dataset	Result
Push and Switch	OpenAI Fetch - Push and Switch 3-Push + 3-Switch (S+O) (test)	Success Rate68.5	18
2-Switch	Push & Switch	Success Rate97.8	9
Switch	OpenAI Fetch Switch 2-Switch (L) (test)	Success Rate91.9	9
2-Push	Push & Switch	Success Rate98.1	9
3-Push	Push & Switch	Success Rate93.1	9
Object Interaction	Spriteworld (train)	Success Rate82.4	9
Object Interaction	Spriteworld unseen object numbers (test)	Avg Success Rate80.2	9
Push and Switch	OpenAI Fetch - Push and Switch 2-Push + 2-Switch (L+S) (test)	Success Rate54.9	9
Object Comparison	Spriteworld (train)	Success Rate81.2	9
Object Comparison	Spriteworld	Success Rate73.5	9

Showing 10 of 27 rows

Other info

Follow for update

@wizwand_team Discord