Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Variational Inference MPC using Tsallis Divergence

About

In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using thenon-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function, a novel Tsallis Variational Inference-Model Predictive Control algorithm is derived, which includes prior works such as Variational Inference-Model Predictive Control, Model Predictive PathIntegral Control, Cross Entropy Method, and Stein VariationalInference Model Predictive Control as special cases. The proposed algorithm allows for effective control of the cost/reward transform and is characterized by superior performance in terms of mean and variance reduction of the associated cost. The aforementioned features are supported by a theoretical and numerical analysis on the level of risk sensitivity of the proposed algorithm as well as simulation experiments on 5 different robotic systems with 3 different policy parameterizations.

Ziyi Wang, Oswin So, Jason Gibson, Bogdan Vlahov, Manan S. Gandhi, Guan-Horng Liu, Evangelos A. Theodorou• 2021

Related benchmarks

TaskDatasetResultRank
Trajectory Optimization8x8m Open Environment Goal (6, 0)
Success Rate100
4
Trajectory Optimization8x8m Cluttered Environment Goal (6, 0)
Success Rate (SR)100
4
Trajectory Optimization8x8m Open Environment Goal (6, -4)
Success Rate (SR)100
4
Trajectory Optimization8x8m Cluttered Environment Goal (6, -4)
Success Rate (SR)100
4
Point-to-point NavigationUnknown Cluttered Environments
Success Rate60
4
Showing 5 of 5 rows

Other info

Follow for update