Asking Easy Questions: A User-Friendly Approach to Active Reward Learning

About

Robots can learn the right reward function by querying a human expert. Existing approaches attempt to choose questions where the robot is most uncertain about the human's response; however, they do not consider how easy it will be for the human to answer! In this paper we explore an information gain formulation for optimally selecting questions that naturally account for the human's ability to answer. Our approach identifies questions that optimize the trade-off between robot and human uncertainty, and determines when these questions become redundant or costly. Simulations and a user study show our method not only produces easy questions, but also ultimately results in faster reward learning.

Erdem B{\i}y{\i}k, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, Dorsa Sadigh• 2019

Related benchmarks

Task	Dataset	Result
Preference Learning	Driving Simulated	Alignment0.948	15
Query Generation	Unit ball representation space	Mean Runtime (ms)418	12
Preference Learning	Lunar Lander Simulated	Alignment93.3	3
Preference Learning	Robot Face Design Simulated	Alignment96	3
Preference Learning	Robot Voice Design Simulated	Alignment0.852	3
Robot preference learning	User Study Aggregated Physical and Social HRI Tasks	Behavioral Adaptation Score4.48	3

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord