Skip to main content

Showing 1–3 of 3 results for author: Shcherbatyi, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2012.08489  [pdf, other

    cs.LG cs.AI stat.ML

    Amazon SageMaker Automatic Model Tuning: Scalable Gradient-Free Optimization

    Authors: Valerio Perrone, Huibin Shen, Aida Zolic, Iaroslav Shcherbatyi, Amr Ahmed, Tanya Bansal, Michele Donini, Fela Winkelmolen, Rodolphe Jenatton, Jean Baptiste Faddoul, Barbara Pogorzelska, Miroslav Miladinovic, Krishnaram Kenthapadi, Matthias Seeger, Cédric Archambeau

    Abstract: Tuning complex machine learning systems is challenging. Machine learning typically requires to set hyperparameters, be it regularization, architecture, or optimization parameters, whose tuning is critical to achieve good predictive performance. To democratize access to machine learning systems, it is essential to automate the tuning. This paper presents Amazon SageMaker Automatic Model Tuning (AMT… ▽ More

    Submitted 18 June, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  2. arXiv:1910.07003  [pdf, other

    stat.ML cs.LG

    Constrained Bayesian Optimization with Max-Value Entropy Search

    Authors: Valerio Perrone, Iaroslav Shcherbatyi, Rodolphe Jenatton, Cedric Archambeau, Matthias Seeger

    Abstract: Bayesian optimization (BO) is a model-based approach to sequentially optimize expensive black-box functions, such as the validation error of a deep neural network with respect to its hyperparameters. In many real-world scenarios, the optimization is further subject to a priori unknown constraints. For example, training a deep network configuration may fail with an out-of-memory error when the mode… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  3. arXiv:1602.06746  [pdf, ps, other

    cs.LG math.OC stat.ML

    Convexification of Learning from Constraints

    Authors: Iaroslav Shcherbatyi, Bjoern Andres

    Abstract: Regularized empirical risk minimization with constrained labels (in contrast to fixed labels) is a remarkably general abstraction of learning. For common loss and regularization functions, this optimization problem assumes the form of a mixed integer program (MIP) whose objective function is non-convex. In this form, the problem is resistant to standard optimization techniques. We construct MIPs w… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.