计算机科学
贝叶斯优化
超参数
人工智能
高斯过程
人工神经网络
卷积神经网络
机器学习
深度学习
全球定位系统
水准点(测量)
最优化问题
替代模型
可扩展性
比例(比率)
高斯分布
算法
电信
物理
大地测量学
量子力学
数据库
地理
作者
Jasper Snoek,Oren Rippel,Kevin Swersky,Ryan Kiros,Nadathur Satish,Narayanan Sundaram,Mostofa Patwary,. Prabhat,Ryan P. Adams
出处
期刊:Cornell University - arXiv
日期:2015-01-01
被引量:348
标识
DOI:10.48550/arxiv.1502.05700
摘要
Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for this distribution over functions is critical to the effectiveness of the approach, and is typically fit using Gaussian processes (GPs). However, since GPs scale cubically with the number of observations, it has been challenging to handle objectives whose optimization requires many evaluations, and as such, massively parallelizing the optimization. In this work, we explore the use of neural networks as an alternative to GPs to model distributions over functions. We show that performing adaptive basis function regression with a neural network as the parametric form performs competitively with state-of-the-art GP-based approaches, but scales linearly with the number of data rather than cubically. This allows us to achieve a previously intractable degree of parallelism, which we apply to large scale hyperparameter optimization, rapidly finding competitive models on benchmark object recognition tasks using convolutional networks, and image caption generation using neural language models.
科研通智能强力驱动
Strongly Powered by AbleSci AI