Meta-Learning for Black-box Optimization

About

Recently, neural networks trained as optimizers under the "learning to learn" or meta-learning framework have been shown to be effective for a broad range of optimization tasks including derivative-free black-box function optimization. Recurrent neural networks (RNNs) trained to optimize a diverse set of synthetic non-convex differentiable functions via gradient descent have been effective at optimizing derivative-free black-box functions. In this work, we propose RNN-Opt: an approach for learning RNN-based optimizers for optimizing real-parameter single-objective continuous functions under limited budget constraints. Existing approaches utilize an observed improvement based meta-learning loss function for training such models. We propose training RNN-Opt by using synthetic non-convex functions with known (approximate) optimal values by directly using discounted regret as our meta-learning loss function. We hypothesize that a regret-based loss function mimics typical testing scenarios, and would therefore lead to better optimizers compared to optimizers trained only to propose queries that improve over previous queries. Further, RNN-Opt incorporates simple yet effective enhancements during training and inference procedures to deal with the following practical challenges: i) Unknown range of possible values for the black-box function to be optimized, and ii) Practical and domain-knowledge based constraints on the input parameters. We demonstrate the efficacy of RNN-Opt in comparison to existing methods on several synthetic as well as standard benchmark black-box functions along with an anonymized industrial constrained optimization problem.

Vishnu TV, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff• 2019

Related benchmarks

Task	Dataset	Result
Path planning	UAV Benchmark 40 terrain scenarios S.I	Terrain 1 Cost5.69e+4	14
Black-box Optimization	BBOB-30D	Buche_Ras1.60e+4	12
Black-box Optimization	BBOB 10D	BucheRastrigin6.62e+3	12
High-dimensional Numerical Optimization	LSGO-1000D	Shifted Elliptic2.10e+11	11
Black-box Optimization	BBOB surrogate 10-dimensional (out-of-distribution)	Rastrigin Function Value330.4	7
UAV Path Planning	UAV Benchmark 56 distinct terrain scenarios (Last 16 terrains (41-56))	Path Cost2.59e+4	7

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord