Skip to main content

Showing 1–21 of 21 results for author: Sabach, S

.
  1. arXiv:2406.01838  [pdf, other

    cs.LG cs.AI

    Learning the Target Network in Function Space

    Authors: Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor

    Abstract: We focus on the task of learning the value function in the reinforcement learning (RL) setting. This task is often solved by updating a pair of online and target networks while ensuring that the parameters of these two networks are equivalent. We propose Lookahead-Replicate (LR), a new value-function approximation algorithm that is agnostic to this parameter-space equivalence. Instead, the LR algo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML24)

  2. arXiv:2401.08893  [pdf, other

    cs.LG math.OC

    MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

    Authors: Kaan Ozkara, Can Karakus, Parameswaran Raman, Mingyi Hong, Shoham Sabach, Branislav Kveton, Volkan Cevher

    Abstract: Following the introduction of Adam, several novel adaptive optimizers for deep learning have been proposed. These optimizers typically excel in some tasks but may not outperform Adam uniformly across all tasks. In this work, we introduce Meta-Adaptive Optimizers (MADA), a unified optimizer framework that can generalize several known optimizers and dynamically learn the most suitable one during tra… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2401.03058  [pdf, other

    math.OC cs.LG stat.ML

    Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

    Authors: Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

    Abstract: Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computational costs. One promising approach is to execute second-order updates within a lower-dimensional subspace, giving rise to subspace second-order methods.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 27 pages, 2 figures

  4. arXiv:2310.05905  [pdf, other

    cs.LG cs.AI cs.RO

    TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

    Authors: Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor

    Abstract: The full potential of large pretrained models remains largely untapped in control domains like robotics. This is mainly because of the scarcity of data and the computational challenges associated with training or fine-tuning these large models for such applications. Prior work mainly emphasizes either effective pretraining of large models for decision-making or single-task adaptation. But real-wor… ▽ More

    Submitted 8 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Published on ICLR 2024

  5. arXiv:2307.08245  [pdf, other

    math.OC cs.LG

    Convex Bi-Level Optimization Problems with Non-smooth Outer Objective Function

    Authors: Roey Merchav, Shoham Sabach

    Abstract: In this paper, we propose the Bi-Sub-Gradient (Bi-SG) method, which is a generalization of the classical sub-gradient method to the setting of convex bi-level optimization problems. This is a first-order method that is very easy to implement in the sense that it requires only a computation of the associated proximal map** or a sub-gradient of the outer non-smooth objective function, in addition… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted for publication In SIAM journal on Optimization

    MSC Class: 65K10; 90C05; 90C25; 90C30; 90C52

  6. arXiv:2306.17833  [pdf, other

    cs.LG cs.AI

    Resetting the Optimizer in Deep RL: An Empirical Study

    Authors: Kavosh Asadi, Rasool Fakoor, Shoham Sabach

    Abstract: We focus on the task of approximating the optimal value function in deep reinforcement learning. This iterative process is comprised of solving a sequence of optimization problems where the loss function changes per iteration. The common approach to solving this sequence of problems is to employ modern variants of the stochastic gradient descent algorithm such as Adam. These optimizers maintain th… ▽ More

    Submitted 14 November, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted at Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. arXiv:2306.17750  [pdf, other

    cs.LG

    TD Convergence: An Optimization Perspective

    Authors: Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor

    Abstract: We study the convergence behavior of the celebrated temporal-difference (TD) learning algorithm. By looking at the algorithm through the lens of optimization, we first argue that TD can be viewed as an iterative optimization algorithm where the function to be minimized changes per iteration. By carefully investigating the divergence displayed by TD on a classical counter example, we identify two f… ▽ More

    Submitted 8 November, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted at Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  8. arXiv:2210.13968  [pdf, other

    math.OC cs.LG

    Faster Projection-Free Augmented Lagrangian Methods via Weak Proximal Oracle

    Authors: Dan Garber, Tsur Livney, Shoham Sabach

    Abstract: This paper considers a convex composite optimization problem with affine constraints, which includes problems that take the form of minimizing a smooth convex objective function over the intersection of (simple) convex sets, or regularized with multiple (simple) functions. Motivated by high-dimensional applications in which exact projection/proximal computations are not tractable, we propose a \te… ▽ More

    Submitted 21 February, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS), 2023

  9. arXiv:2010.14314  [pdf, ps, other

    math.OC math.NA

    Faster Lagrangian-Based Methods in Convex Optimization

    Authors: Shoham Sabach, Marc Teboulle

    Abstract: In this paper, we aim at unifying, simplifying and improving the convergence rate analysis of Lagrangian-based methods for convex optimization problems. We first introduce the notion of nice primal algorithmic map, which plays a central role in the unification and in the simplification of the analysis of most Lagrangian-based methods. Equipped with a nice primal algorithmic map, we then introduce… ▽ More

    Submitted 6 June, 2023; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Minor corrections

    MSC Class: 90C25; 65K05

    Journal ref: SIAM Journal on Optimization, Vol. 32, Iss. 1 (2022)

  10. Alternating Minimization Based First-Order Method for the Wireless Sensor Network Localization Problem

    Authors: Eyal Gur, Shoham Sabach, Shimrit Shtern

    Abstract: We propose an algorithm for the Wireless Sensor Network localization problem, which is based on the well-known algorithmic framework of Alternating Minimization. We start with a non-smooth and non-convex minimization, and transform it into an equivalent smooth and non-convex problem, which stands at the heart of our study. This paves the way to a new method which is globally convergent: not only d… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  11. arXiv:2010.04504  [pdf, other

    math.OC

    Non-Convex Split Feasibility Problems: Models, Algorithms and Theory

    Authors: Aviv Gibali, Shoham Sabach, Sergey Voldman

    Abstract: In this paper, we propose a catalog of iterative methods for solving the Split Feasibility Problem in the non-convex setting. We study four different optimization formulations of the problem, where each model has advantageous in different settings of the problem. For each model, we study relevant iterative algorithms, some of which are well-known in this area and some are new. All the studied meth… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: To appear in Open Journal of Mathematical Optimization

  12. arXiv:1904.03537  [pdf, other

    math.OC cs.CV cs.LG math.NA

    Convex-Concave Backtracking for Inertial Bregman Proximal Gradient Algorithms in Non-Convex Optimization

    Authors: Mahesh Chandra Mukkamala, Peter Ochs, Thomas Pock, Shoham Sabach

    Abstract: Backtracking line-search is an old yet powerful strategy for finding a better step sizes to be used in proximal gradient algorithms. The main principle is to locally find a simple convex upper bound of the objective function, which in turn controls the step size that is used. In case of inertial proximal gradient algorithms, the situation becomes much more difficult and usually leads to very restr… ▽ More

    Submitted 5 November, 2019; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: 29 pages

    MSC Class: 90C25; 26B25; 49M27; 52A41; 65K05

  13. Optimization on Spheres: Models and Proximal Algorithms with Computational Performance Comparisons

    Authors: D. Russell Luke, Shoham Sabach, Marc Teboulle

    Abstract: We present a unified treatment of the abstract problem of finding the best approximation between a cone and spheres in the image of affine transformations. Prominent instances of this problem are phase retrieval and source localization. The common geometry binding these problems permits a generic application of algorithmic ideas and abstract convergence results for nonconvex optimization. We organ… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: 23 pages, 11 benchmarking studies

    Journal ref: SIAM J. Mathematics of Data Science, 1(3), 408-445 (2019)

  14. arXiv:1802.05581  [pdf, other

    cs.LG math.OC

    Improved Complexities of Conditional Gradient-Type Methods with Applications to Robust Matrix Recovery Problems

    Authors: Dan Garber, Shoham Sabach, Atara Kaplan

    Abstract: Motivated by robust matrix recovery problems such as Robust Principal Component Analysis, we consider a general optimization problem of minimizing a smooth and strongly convex loss function applied to the sum of two blocks of variables, where each block of variables is constrained or regularized individually. We study a Conditional Gradient-Type method which is able to leverage the special structu… ▽ More

    Submitted 15 November, 2019; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Accepted to Mathematical Programming

  15. arXiv:1801.03013  [pdf, ps, other

    math.OC

    Nonconvex Lagrangian-Based Optimization: Monitoring Schemes and Global Convergence

    Authors: Jérôme Bolte, Shoham Sabach, Marc Teboulle

    Abstract: We introduce a novel approach addressing global analysis of a difficult class of nonconvex-nonsmooth optimization problems within the important framework of Lagrangian-based methods. This genuine nonlinear class captures many problems in modern disparate fields of applications. It features complex geometries, qualification conditions, and other regularity properties do not hold everywhere. To addr… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: Accepted for publication in "Mathematics of Operations Research", August 27, 2017

    MSC Class: 90C30; 49M37; 65K10

  16. arXiv:1706.06461  [pdf, ps, other

    math.OC math.NA

    First Order Methods beyond Convexity and Lipschitz Gradient Continuity with Applications to Quadratic Inverse Problems

    Authors: Jérôme Bolte, Shoham Sabach, Marc Teboulle, Yakov Vaisbourd

    Abstract: We focus on nonconvex and nonsmooth minimization problems with a composite objective, where the differentiable part of the objective is freed from the usual and restrictive global Lipschitz gradient continuity assumption. This longstanding smoothness restriction is pervasive in first order methods (FOM), and was recently circumvent for convex composite optimization by Bauschke, Bolte and Teboulle,… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

  17. On Fienup Methods for Regularized Phase Retrieval

    Authors: Edouard Pauwels, Amir Beck, Yonina C. Eldar, Shoham Sabach

    Abstract: Alternating minimization, or Fienup methods, have a long history in phase retrieval. We provide new insights related to the empirical and theoretical analysis of these algorithms when used with Fourier measurements and combined with convex priors. In particular, we show that Fienup methods can be viewed as performing alternating minimization on a regularized nonconvex least-squares problem with re… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

  18. arXiv:1702.03999  [pdf, ps, other

    math.OC

    A First Order Method for Solving Convex Bi-Level Optimization Problems

    Authors: Shoham Sabach, Shimrit Shtern

    Abstract: In this paper we study convex bi-level optimization problems for which the inner level consists of minimization of the sum of smooth and nonsmooth functions. The outer level aims at minimizing a smooth and strongly convex function over the optimal solutions set of the inner problem. We analyze a first order method which is based on an existing fixed-point algorithm. Global sublinear rate of conver… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

  19. Inertial Proximal Alternating Linearized Minimization (iPALM) for Nonconvex and Nonsmooth Problems

    Authors: Thomas Pock, Shoham Sabach

    Abstract: In this paper we study nonconvex and nonsmooth optimization problems with semi-algebraic data, where the variables vector is split into several blocks of variables. The problem consists of one smooth function of the entire variables vector and the sum of nonsmooth functions for each block separately. We analyze an inertial version of the Proximal Alternating Linearized Minimization (PALM) algorith… ▽ More

    Submitted 8 February, 2017; originally announced February 2017.

  20. arXiv:1502.03716  [pdf, other

    math.OC

    The Cyclic Block Conditional Gradient Method for Convex Optimization Problems

    Authors: Amir Beck, Edouard Pauwels, Shoham Sabach

    Abstract: In this paper we study the convex problem of optimizing the sum of a smooth function and a compactly supported non-smooth term with a specific separable form. We analyze the block version of the generalized conditional gradient method when the blocks are chosen in a cyclic order. A global sublinear rate of convergence is established for two different stepsize strategies commonly used in this class… ▽ More

    Submitted 25 September, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

    Comments: 26 pages, 2 figures

  21. arXiv:1408.1887  [pdf, other

    math.OC math.NA

    Proximal Heterogeneous Block Input-Output Method and application to Blind Ptychographic Diffraction Imaging

    Authors: Robert Hesse, D. Russell Luke, Shoham Sabach, Matthew K. Tam

    Abstract: We propose a general alternating minimization algorithm for nonconvex optimization problems with separable structure and nonconvex coupling between blocks of variables. To fix our ideas, we apply the methodology to the problem of blind ptychographic imaging. Compared to other schemes in the literature, our approach differs in two ways: (i) it is posed within a clear mathematical framework with pra… ▽ More

    Submitted 8 August, 2014; originally announced August 2014.

    Comments: 32 pages, 3 tables, 5 figures

    MSC Class: 90C05; 90C25; 90C30; 90C52; 65K05

    Journal ref: SIAM J. on Imaging Sciences, 8(1):426--457 (2015)