Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Asking Easy Questions: A User-Friendly Approach to Active Reward Learning

About

Robots can learn the right reward function by querying a human expert. Existing approaches attempt to choose questions where the robot is most uncertain about the human's response; however, they do not consider how easy it will be for the human to answer! In this paper we explore an information gain formulation for optimally selecting questions that naturally account for the human's ability to answer. Our approach identifies questions that optimize the trade-off between robot and human uncertainty, and determines when these questions become redundant or costly. Simulations and a user study show our method not only produces easy questions, but also ultimately results in faster reward learning.

Erdem B{\i}y{\i}k, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, Dorsa Sadigh• 2019

Related benchmarks

TaskDatasetResultRank
Preference LearningDriving Simulated
Alignment0.948
15
Query GenerationUnit ball representation space
Mean Runtime (ms)418
12
Preference LearningLunar Lander Simulated
Alignment93.3
3
Preference LearningRobot Face Design Simulated
Alignment96
3
Preference LearningRobot Voice Design Simulated
Alignment0.852
3
Robot preference learningUser Study Aggregated Physical and Social HRI Tasks
Behavioral Adaptation Score4.48
3
Showing 6 of 6 rows

Other info

Follow for update