Skip to main content

Showing 1–9 of 9 results for author: Hoffman, M W

Searching in archive stat. Search in all archives.
.
  1. arXiv:1909.05557  [pdf, other

    cs.LG cs.AI stat.ML

    Modular Meta-Learning with Shrinkage

    Authors: Yutian Chen, Abram L. Friesen, Feryal Behbahani, Arnaud Doucet, David Budden, Matthew W. Hoffman, Nando de Freitas

    Abstract: Many real-world problems, including multi-speaker text-to-speech synthesis, can greatly benefit from the ability to meta-learn large models with only a few task-specific components. Updating only these task-specific modules then allows the model to be adapted to low-data tasks for as many steps as necessary without risking overfitting. Unfortunately, existing meta-learning methods either do not sc… ▽ More

    Submitted 22 October, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Accepted by NeurIPS 2020

  2. arXiv:1804.08617  [pdf, other

    cs.LG cs.AI stat.ML

    Distributed Distributional Deterministic Policy Gradients

    Authors: Gabriel Barth-Maron, Matthew W. Hoffman, David Budden, Will Dabney, Dan Horgan, Dhruva TB, Alistair Muldal, Nicolas Heess, Timothy Lillicrap

    Abstract: This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we call the Distributed Distributional Deep Deterministic Policy Gradient algorithm, D4PG. We also combine this technique with a number of additional, simple improve… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

  3. arXiv:1703.04813  [pdf, other

    cs.LG cs.NE stat.ML

    Learned Optimizers that Scale and Generalize

    Authors: Olga Wichrowska, Niru Maheswaranathan, Matthew W. Hoffman, Sergio Gomez Colmenarejo, Misha Denil, Nando de Freitas, Jascha Sohl-Dickstein

    Abstract: Learning to learn has emerged as an important direction for achieving artificial intelligence. Two of the primary barriers to its adoption are an inability to scale to larger problems and a limited ability to generalize to new tasks. We introduce a learned gradient descent optimizer that generalizes well to new tasks, and which has significantly reduced memory and computation overhead. We achieve… ▽ More

    Submitted 7 September, 2017; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: Final ICML paper after reviewer suggestions

  4. arXiv:1611.03824  [pdf, other

    stat.ML cs.LG

    Learning to Learn without Gradient Descent by Gradient Descent

    Authors: Yutian Chen, Matthew W. Hoffman, Sergio Gomez Colmenarejo, Misha Denil, Timothy P. Lillicrap, Matt Botvinick, Nando de Freitas

    Abstract: We learn recurrent neural network optimizers trained on simple synthetic functions by gradient descent. We show that these learned optimizers exhibit a remarkable degree of transfer in that they can be used to efficiently optimize a broad range of derivative-free black-box functions, including Gaussian process bandits, simple control objectives, global optimization benchmarks and hyper-parameter t… ▽ More

    Submitted 12 June, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: Accepted by ICML 2017. Previous version "Learning to Learn for Global Optimization of Black Box Functions" was published in the Deep Reinforcement Learning Workshop, NIPS 2016

  5. arXiv:1511.09422  [pdf, other

    stat.ML

    A General Framework for Constrained Bayesian Optimization using Information-based Search

    Authors: José Miguel Hernández-Lobato, Michael A. Gelbart, Ryan P. Adams, Matthew W. Hoffman, Zoubin Ghahramani

    Abstract: We present an information-theoretic framework for solving global black-box optimization problems that also have black-box constraints. Of particular interest to us is to efficiently solve problems with decoupled constraints, in which subsets of the objective and constraint functions may be evaluated independently. For example, when the objective is evaluated on a CPU and the constraints are evalua… ▽ More

    Submitted 4 September, 2016; v1 submitted 30 November, 2015; originally announced November 2015.

  6. arXiv:1502.05312  [pdf, other

    stat.ML

    Predictive Entropy Search for Bayesian Optimization with Unknown Constraints

    Authors: José Miguel Hernández-Lobato, Michael A. Gelbart, Matthew W. Hoffman, Ryan P. Adams, Zoubin Ghahramani

    Abstract: Unknown constraints arise in many types of expensive black-box optimization problems. Several methods have been proposed recently for performing Bayesian optimization with constraints, based on the expected improvement (EI) heuristic. However, EI can lead to pathologies when used with constraints. For example, in the case of decoupled constraints---i.e., when one can independently evaluate the obj… ▽ More

    Submitted 15 July, 2015; v1 submitted 18 February, 2015; originally announced February 2015.

  7. arXiv:1406.4625  [pdf, other

    stat.ML cs.LG

    An Entropy Search Portfolio for Bayesian Optimization

    Authors: Bobak Shahriari, Ziyu Wang, Matthew W. Hoffman, Alexandre Bouchard-Côté, Nando de Freitas

    Abstract: Bayesian optimization is a sample-efficient method for black-box global optimization. How- ever, the performance of a Bayesian optimization method very much depends on its exploration strategy, i.e. the choice of acquisition function, and it is not clear a priori which choice will result in superior performance. While portfolio methods provide an effective, principled way of combining a collection… ▽ More

    Submitted 4 March, 2015; v1 submitted 18 June, 2014; originally announced June 2014.

    Comments: 10 pages, 5 figures

  8. arXiv:1406.2541  [pdf, other

    stat.ML cs.LG

    Predictive Entropy Search for Efficient Global Optimization of Black-box Functions

    Authors: José Miguel Hernández-Lobato, Matthew W. Hoffman, Zoubin Ghahramani

    Abstract: We propose a novel information-theoretic approach for Bayesian optimization called Predictive Entropy Search (PES). At each iteration, PES selects the next evaluation point that maximizes the expected information gained with respect to the global maximum. PES codifies this intractable acquisition function in terms of the expected reduction in the differential entropy of the predictive distribution… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

  9. arXiv:1303.6746  [pdf, other

    stat.ML cs.LG

    Exploiting correlation and budget constraints in Bayesian multi-armed bandit optimization

    Authors: Matthew W. Hoffman, Bobak Shahriari, Nando de Freitas

    Abstract: We address the problem of finding the maximizer of a nonlinear smooth function, that can only be evaluated point-wise, subject to constraints on the number of permitted function evaluations. This problem is also known as fixed-budget best arm identification in the multi-armed bandit literature. We introduce a Bayesian approach for this problem and show that it empirically outperforms both the exis… ▽ More

    Submitted 11 November, 2013; v1 submitted 27 March, 2013; originally announced March 2013.