Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning

About

We present a novel procedure for optimization based on the combination of efficient quantized tensor train representation and a generalized maximum matrix volume principle. We demonstrate the applicability of the new Tensor Train Optimizer (TTOpt) method for various tasks, ranging from minimization of multidimensional functions to reinforcement learning. Our algorithm compares favorably to popular evolutionary-based methods and outperforms them by the number of function evaluations or execution time, often by a significant margin.

Konstantin Sozykin, Andrei Chertkov, Roman Schutski, Anh-Huy Phan, Andrzej Cichocki, Ivan Oseledets• 2022

Related benchmarks

TaskDatasetResultRank
Reinforcement LearningSwimmer v3
Mean Reward357.5
15
Reinforcement LearningHalfCheetah v3
Mean Reward4.21e+3
15
Global OptimizationF1
Final Error3.90e-6
14
Global OptimizationF2 benchmark function
Final Error2.90e-7
14
Global OptimizationF6 benchmark function
F6 Final Error0.11
14
Global OptimizationF8 benchmark function
Final Error (ε)4.60e-11
14
Global OptimizationF10 benchmark function
Final Error1.30e-4
14
Global OptimizationF9 benchmark function
Final Error0.18
14
Global OptimizationF4
Final Error4.40e-15
14
Global OptimizationF5 benchmark function
Final Error0.028
14
Showing 10 of 38 rows

Other info

Code

Follow for update