Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Objective Deep Reinforcement Learning

About

We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-objective decision problems where the relative importances of the objectives are not known a priori. Using features from the high-dimensional inputs, DOL computes the convex coverage set containing all potential optimal solutions of the convex combinations of the objectives. To our knowledge, this is the first time that deep reinforcement learning has succeeded in learning multi-objective policies. In addition, we provide a testbed with two experiments to be used as a benchmark for deep multi-objective reinforcement learning.

Hossam Mossalam, Yannis M. Assael, Diederik M. Roijers, Shimon Whiteson• 2016

Related benchmarks

TaskDatasetResultRank
Multi-objective Reinforcement LearningQueue
MER4.19
11
Multi-objective Reinforcement LearningMaze
Mean Episode Reward (MER)16.15
11
Showing 2 of 2 rows

Other info

Follow for update