Scalable Bayesian Optimization Using Deep Neural Networks

About

Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for this distribution over functions is critical to the effectiveness of the approach, and is typically fit using Gaussian processes (GPs). However, since GPs scale cubically with the number of observations, it has been challenging to handle objectives whose optimization requires many evaluations, and as such, massively parallelizing the optimization. In this work, we explore the use of neural networks as an alternative to GPs to model distributions over functions. We show that performing adaptive basis function regression with a neural network as the parametric form performs competitively with state-of-the-art GP-based approaches, but scales linearly with the number of data rather than cubically. This allows us to achieve a previously intractable degree of parallelism, which we apply to large scale hyperparameter optimization, rapidly finding competitive models on benchmark object recognition tasks using convolutional networks, and image caption generation using neural language models.

Jasper Snoek, Oren Rippel, Kevin Swersky, Ryan Kiros, Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Prabhat, Ryan P. Adams• 2015

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	--	3518
Image Classification	CIFAR-10 (test)	--	3381
Classification	SVHN (test)	Error Rate1.77	182
Neural Architecture Search	NAS-Bench-201 ImageNet-16-120 (test)	Accuracy46.4	140
Neural Architecture Search	CIFAR-10 NAS-Bench-201 (val)	Accuracy94.05	111
Neural Architecture Search	NAS-Bench-201 CIFAR-10 (test)	Accuracy94.22	110
Neural Architecture Search	ImageNet16-120 NAS-Bench-201 (val)	Accuracy46.43	104
Neural Architecture Search	NAS-Bench-201 CIFAR-100 (test)	Accuracy73.22	103
Neural Architecture Search	CIFAR-100 NAS-Bench-201 (val)	Accuracy73.26	92
Image Classification	CIFAR-10 Standard data augmentation (test)	Test Error Rate6.37	43

Showing 10 of 39 rows

Other info

Code

Follow for update

@wizwand_team Discord