Fully Distributed Bayesian Optimization with Stochastic Policies

Fully Distributed Bayesian Optimization with Stochastic Policies

Javier Garcia-Barcos, Ruben Martinez-Cantin

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Main track. Pages 2357-2363. https://doi.org/10.24963/ijcai.2019/327

Bayesian optimization has become a popular method for applications, like the design of computer experiments or hyperparameter tuning of expensive models, where sample efficiency is mandatory. These situations or high-throughput computing, where distributed and scalable architectures are a necessity. However, Bayesian optimization is mostly sequential. Even parallel variants require certain computations between samples, limiting the parallelization bandwidth. Thompson sampling has been previously applied for distributed Bayesian optimization. But, when compared with other acquisition functions in the sequential setting, Thompson sampling is known to perform suboptimally. In this paper, we present a new method for fully distributed Bayesian optimization, which can be combined with any acquisition function. Our approach considers Bayesian optimization as a partially observable Markov decision process. In this context, stochastic policies, such as the Boltzmann policy, have some interesting properties which can also be studied for Bayesian optimization. Furthermore, the Boltzmann policy trivially allows a distributed Bayesian optimization implementation with high level of parallelism and scalability. We present results in several benchmarks and applications that shows the performance of our method.
Keywords:
Machine Learning: Active Learning
Planning and Scheduling: POMDPs