Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Self-supervised Reinforcement Learning with Independently Controllable Subgoals

About

To successfully tackle challenging manipulation tasks, autonomous agents must learn a diverse set of skills and how to combine them. Recently, self-supervised agents that set their own abstract goals by exploiting the discovered structure in the environment were shown to perform well on many different tasks. In particular, some of them were applied to learn basic manipulation skills in compositional multi-object environments. However, these methods learn skills without taking the dependencies between objects into account. Thus, the learned skills are difficult to combine in realistic environments. We propose a novel self-supervised agent that estimates relations between environment components and uses them to independently control different parts of the environment state. In addition, the estimated relations between objects can be used to decompose a complex goal into a compatible sequence of subgoals. We show that, by using this framework, an agent can efficiently and automatically learn manipulation tasks in multi-object environments with different relations between objects.

Andrii Zadaianchuk, Georg Martius, Fanny Yang• 2021

Related benchmarks

TaskDatasetResultRank
Push and SwitchOpenAI Fetch - Push and Switch 3-Push + 3-Switch (S+O) (test)
Success Rate68.5
18
2-SwitchPush & Switch
Success Rate97.8
9
SwitchOpenAI Fetch Switch 2-Switch (L) (test)
Success Rate91.9
9
2-PushPush & Switch
Success Rate98.1
9
3-PushPush & Switch
Success Rate93.1
9
Object InteractionSpriteworld (train)
Success Rate82.4
9
Object InteractionSpriteworld unseen object numbers (test)
Avg Success Rate80.2
9
Push and SwitchOpenAI Fetch - Push and Switch 2-Push + 2-Switch (L+S) (test)
Success Rate54.9
9
Object ComparisonSpriteworld (train)
Success Rate81.2
9
Object ComparisonSpriteworld
Success Rate73.5
9
Showing 10 of 27 rows

Other info

Follow for update