Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

About

Customizing robotic behaviors to be aligned with diverse human preferences is an underexplored challenge in the field of embodied AI. In this paper, we present Promptable Behaviors, a novel framework that facilitates efficient personalization of robotic agents to diverse human preferences in complex environments. We use multi-objective reinforcement learning to train a single policy adaptable to a broad spectrum of preferences. We introduce three distinct methods to infer human preferences by leveraging different types of interactions: (1) human demonstrations, (2) preference feedback on trajectory comparisons, and (3) language instructions. We evaluate the proposed method in personalized object-goal navigation and flee navigation tasks in ProcTHOR and RoboTHOR, demonstrating the ability to prompt agent behaviors to satisfy human preferences in various scenarios. Project page: https://promptable-behaviors.github.io

Minyoung Hwang, Luca Weihs, Chanwoo Park, Kimin Lee, Aniruddha Kembhavi, Kiana Ehsani• 2023

Related benchmarks

Task	Dataset	Result
Preference Profile Estimation	Summary	Misprediction Rate0.124	24
Preference Profile Estimation	Assistant	Mis-prediction Rate5.5	24
Personalized Response Generation	Assistant and Summary personalization tasks (test)	Win Rate52.44	12

Showing 3 of 3 rows

Other info

Code

Follow for update

@wizwand_team Discord