Skip to main content

Showing 1–14 of 14 results for author: Levy, K Y

Searching in archive math. Search in all archives.
.
  1. arXiv:2304.04172  [pdf, ps, other

    cs.LG math.OC

    $μ^2$-SGD: Stable Stochastic Optimization via a Double Momentum Mechanism

    Authors: Kfir Y. Levy

    Abstract: We consider stochastic convex optimization problems where the objective is an expectation over smooth functions. For this setting we suggest a novel gradient estimate that combines two recent mechanism that are related to notion of momentum. Then, we design an SGD-style algorithm as well as an accelerated version that make use of this new estimator, and demonstrate the robustness of these new appr… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  2. arXiv:2304.04169  [pdf, ps, other

    cs.LG math.OC

    SLowcal-SGD: Slow Query Points Improve Local-SGD for Stochastic Convex Optimization

    Authors: Kfir Y. Levy

    Abstract: We consider distributed learning scenarios where M machines interact with a parameter server along several communication rounds in order to minimize a joint objective function. Focusing on the heterogeneous case, where different machines may draw samples from different data-distributions, we design the first local update method that provably benefits over the two most prominent distributed baselin… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  3. arXiv:2206.09352  [pdf, other

    math.OC

    A universal black-box optimization method with almost dimension-free convergence rate guarantees

    Authors: Kimon Antonakopoulos, Dong Quan Vu, Vokan Cevher, Kfir Y. Levy, Panayotis Mertikopoulos

    Abstract: Universal methods for optimization are designed to achieve theoretically optimal convergence rates without any prior knowledge of the problem's regularity parameters or the accurarcy of the gradient oracle employed by the optimizer. In this regard, existing state-of-the-art algorithms achieve an $\mathcal{O}(1/T^2)$ value convergence rate in Lipschitz smooth problems with a perfect gradient oracle… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 31 pages, 4 figures, 1 table; to appear in ICML 2022

    MSC Class: Primary 90C25; 90C15; secondary 68Q32; 68T05

  4. arXiv:2204.02833  [pdf, ps, other

    math.OC cs.LG

    High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad Stepsize

    Authors: Ali Kavis, Kfir Yehuda Levy, Volkan Cevher

    Abstract: In this paper, we propose a new, simplified high probability analysis of AdaGrad for smooth, non-convex problems. More specifically, we focus on a particular accelerated gradient (AGD) template (Lan, 2020), through which we recover the original AdaGrad and its variant with averaging, and prove a convergence rate of $\mathcal O (1/ \sqrt{T})$ with high probability without the knowledge of smoothnes… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 27 pages, acccepted to ICLR 2022

  5. arXiv:2111.11309  [pdf, ps, other

    cs.LG cs.GT math.OC

    No-Regret Dynamics in the Fenchel Game: A Unified Framework for Algorithmic Convex Optimization

    Authors: Jun-Kun Wang, Jacob Abernethy, Kfir Y. Levy

    Abstract: We develop an algorithmic framework for solving convex optimization problems using no-regret game dynamics. By converting the problem of minimizing a convex function into an auxiliary problem of solving a min-max game in a sequential fashion, we can consider a range of strategies for each of the two-players who must select their actions one after the other. A common choice for these strategies are… ▽ More

    Submitted 18 February, 2023; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.12923

  6. arXiv:2111.01040  [pdf, other

    math.OC cs.LG

    STORM+: Fully Adaptive SGD with Momentum for Nonconvex Optimization

    Authors: Kfir Y. Levy, Ali Kavis, Volkan Cevher

    Abstract: In this work we investigate stochastic non-convex optimization problems where the objective is an expectation over smooth loss functions, and the goal is to find an approximate stationary point. The most popular approach to handling such problems is variance reduction techniques, which are also known to obtain tight convergence rates, matching the lower bounds in this case. Nevertheless, these tec… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 25 pages, 1 figure, accepted to NeurIPS 2021

  7. arXiv:2109.09835  [pdf, ps, other

    math.OC

    Fast Projection Onto Convex Smooth Constraints

    Authors: Ilnura Usmanova, Maryam Kamgarpour, Andreas Krause, Kfir Yehuda Levy

    Abstract: The Euclidean projection onto a convex set is an important problem that arises in numerous constrained optimization tasks. Unfortunately, in many cases, computing projections is computationally demanding. In this work, we focus on projection problems where the constraints are smooth and the number of constraints is significantly smaller than the dimension. The runtime of existing approaches to sol… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  8. arXiv:1910.13857  [pdf, other

    math.OC cs.LG

    UniXGrad: A Universal, Adaptive Algorithm with Optimal Guarantees for Constrained Optimization

    Authors: Ali Kavis, Kfir Y. Levy, Francis Bach, Volkan Cevher

    Abstract: We propose a novel adaptive, accelerated algorithm for the stochastic constrained convex optimization setting. Our method, which is inspired by the Mirror-Prox method, \emph{simultaneously} achieves the optimal rates for smooth/non-smooth problems with either deterministic/stochastic first-order oracles. This is done without any prior knowledge of the smoothness nor the noise properties of the pro… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  9. arXiv:1902.01637  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Universal Algorithm for Variational Inequalities Adaptive to Smoothness and Noise

    Authors: Francis Bach, Kfir Y. Levy

    Abstract: We consider variational inequalities coming from monotone operators, a setting that includes convex minimization and convex-concave saddle-point problems. We assume an access to potentially noisy unbiased values of the monotone operators and assess convergence through a compatible gap function which corresponds to the standard optimality criteria in the aforementioned subcases. We present a univer… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  10. arXiv:1809.02864  [pdf, other

    cs.LG math.OC stat.ML

    Online Adaptive Methods, Universality and Acceleration

    Authors: Kfir Y. Levy, Alp Yurtsever, Volkan Cevher

    Abstract: We present a novel method for convex unconstrained optimization that, without any modifications, ensures: (i) accelerated convergence rate for smooth objectives, (ii) standard convergence rate in the general (non-smooth) setting, and (iii) standard convergence rate in the stochastic optimization setting. To the best of our knowledge, this is the first method that simultaneously applies to all of t… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

  11. arXiv:1705.10499  [pdf, other

    cs.LG math.OC stat.ML

    Online to Offline Conversions, Universality and Adaptive Minibatch Sizes

    Authors: Kfir Y. Levy

    Abstract: We present an approach towards convex optimization that relies on a novel scheme which converts online adaptive algorithms into offline methods. In the offline optimization setting, our derived methods are shown to obtain favourable adaptive guarantees which depend on the harmonic sum of the queried gradients. We further show that our methods implicitly adapt to the objective's structure: in the s… ▽ More

    Submitted 31 May, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  12. arXiv:1611.04831  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Normalization: Faster Evasion of Saddle Points

    Authors: Kfir Y. Levy

    Abstract: A commonly used heuristic in non-convex optimization is Normalized Gradient Descent (NGD) - a variant of gradient descent in which only the direction of the gradient is taken into account and its magnitude ignored. We analyze this heuristic and show that with carefully chosen parameters and noise injection, this method can provably evade saddle points. We establish the convergence of NGD to a loca… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

  13. arXiv:1507.02030  [pdf, other

    cs.LG math.OC

    Beyond Convexity: Stochastic Quasi-Convex Optimization

    Authors: Elad Hazan, Kfir Y. Levy, Shai Shalev-Shwartz

    Abstract: Stochastic convex optimization is a basic and well studied primitive in machine learning. It is well known that convex and Lipschitz functions can be minimized efficiently using Stochastic Gradient Descent (SGD). The Normalized Gradient Descent (NGD) algorithm, is an adaptation of Gradient Descent, which updates according to the direction of the gradients, rather than the gradients themselves. In… ▽ More

    Submitted 28 October, 2015; v1 submitted 8 July, 2015; originally announced July 2015.

  14. arXiv:1503.03712  [pdf, other

    cs.LG math.OC

    On Graduated Optimization for Stochastic Non-Convex Problems

    Authors: Elad Hazan, Kfir Y. Levy, Shai Shalev-Shwartz

    Abstract: The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. Despite its popularity, very little is known in terms of theoretical convergence analysis. In this paper we describe a new first-order algorithm based on graduated optimiza- tion and analyze its performance. We ch… ▽ More

    Submitted 8 July, 2015; v1 submitted 12 March, 2015; originally announced March 2015.

    Comments: 17 pages

    MSC Class: 68