Memory-Augmented Reinforcement Learning for Image-Goal Navigation

About

In this work, we present a memory-augmented approach for image-goal navigation. Earlier attempts, including RL-based and SLAM-based approaches have either shown poor generalization performance, or are heavily-reliant on pose/depth sensors. Our method is based on an attention-based end-to-end model that leverages an episodic memory to learn to navigate. First, we train a state-embedding network in a self-supervised fashion, and then use it to embed previously-visited states into the agent's memory. Our navigation policy takes advantage of this information through an attention mechanism. We validate our approach with extensive evaluations, and show that our model establishes a new state of the art on the challenging Gibson dataset. Furthermore, we achieve this impressive performance from RGB input alone, without access to additional information such as position or depth, in stark contrast to related work.

Lina Mezghani, Sainbayar Sukhbaatar, Thibaut Lavril, Oleksandr Maksymets, Dhruv Batra, Piotr Bojanowski, Karteek Alahari• 2021

Related benchmarks

Task	Dataset	Result
Image-Goal Navigation	MP3D (test)	Success Rate6.9	32
Image-Goal Navigation	Gibson (A)	Success Rate69	22
Image-Goal Navigation	Gibson (test)	Succ (Average)69	17
Image-Goal Navigation	HM3D (test)	Success Rate3.5	16
Image-Goal Navigation	Gibson (Straight-Easy)	Success Rate78	13
Image-Goal Navigation	Gibson Overall	SR69.3	8
Image-Goal Navigation	Gibson Medium	Success Rate (SR)70	6
Image-Goal Navigation	Gibson Hard	Success Rate (SR)60	6

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord