Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning

About

Learning is an inherently continuous phenomenon. When humans learn a new task there is no explicit distinction between training and inference. As we learn a task, we keep learning about it while performing the task. What we learn and how we learn it varies during different stages of learning. Learning how to learn and adapt is a key property that enables us to generalize effortlessly to new settings. This is in contrast with conventional settings in machine learning where a trained model is frozen during inference. In this paper we study the problem of learning to learn at both training and test time in the context of visual navigation. A fundamental challenge in navigation is generalization to unseen scenes. In this paper we propose a self-adaptive visual navigation method (SAVN) which learns to adapt to new environments without any explicit supervision. Our solution is a meta-reinforcement learning approach where an agent learns a self-supervised interaction loss that encourages effective navigation. Our experiments, performed in the AI2-THOR framework, show major improvements in both success rate and SPL for visual navigation in novel scenes. Our code and data are available at: https://github.com/allenai/savn .

Mitchell Wortsman, Kiana Ehsani, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi• 2018

Related benchmarks

TaskDatasetResultRank
Visual NavigationAI2-THOR Unseen Scenes (L >= 5) (test)
SPL13.91
11
Visual NavigationAI2-THOR L ≥ 5 (val)
SPL12.27
8
Visual NavigationAI2-THOR All (val)
SPL0.1386
8
Visual NavigationAI2-THOR Unseen Scenes (All) (test)
SPL16.15
7
ObjectNaviTHOR Seen class (18/4)
Success Rate (SR)76.7
5
ObjectNaviTHOR Unseen class (18/4)
SR81.5
5
ObjectNaviTHOR Seen class (14/8)
Success Rate73.3
5
ObjectNaviTHOR Unseen class (14/8)
Success Rate70.8
5
Visual NavigationAI2-THOR (test)
SPL16.15
4
Showing 9 of 9 rows

Other info

Code

Follow for update