Skip to main content

Showing 1–15 of 15 results for author: Muehlebach, M

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.18100  [pdf, other

    cs.LG math.OC

    A Pontryagin Perspective on Reinforcement Learning

    Authors: Onno Eberhard, Claire Vernade, Michael Muehlebach

    Abstract: Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2405.10618  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Event-Based Learning via ADMM

    Authors: Guner Dilsad Er, Sebastian Trimpe, Michael Muehlebach

    Abstract: We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We can therefore guarantee convergence even if the loca… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 29 pages, 12 figures

  3. arXiv:2404.04355  [pdf, other

    math.OC eess.SY

    Gray-Box Nonlinear Feedback Optimization

    Authors: Zhiyu He, Saverio Bolognani, Michael Muehlebach, Florian Dörfler

    Abstract: Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity of the system to construct gradients, whereas model-free approaches bypass this need by estimating gradients from real-time evaluations of the… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2403.12859  [pdf, other

    math.OC cs.LG stat.ML

    Primal Methods for Variational Inequality Problems with Functional Constraints

    Authors: Liang Zhang, Niao He, Michael Muehlebach

    Abstract: Constrained variational inequality problems are recognized for their broad applications across various fields including machine learning and operations research. First-order methods have emerged as the standard approach for solving these problems due to their simplicity and scalability. However, they typically rely on projection or linear minimization oracles to navigate the feasible set, which be… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  5. arXiv:2401.14029  [pdf, other

    math.OC cs.LG eess.SY

    Towards a Systems Theory of Algorithms

    Authors: Florian Dörfler, Zhiyu He, Giuseppe Belgioioso, Saverio Bolognani, John Lygeros, Michael Muehlebach

    Abstract: Traditionally, numerical algorithms are seen as isolated pieces of code confined to an {\em in silico} existence. However, this perspective is not appropriate for many modern computational approaches in control, learning, or optimization, wherein {\em in vivo} algorithms interact with their environment. Examples of such {\em open algorithms} include various real-time optimization-based control str… ▽ More

    Submitted 30 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  6. arXiv:2306.03655  [pdf, other

    cs.LG math.OC

    Online Learning under Adversarial Nonlinear Constraints

    Authors: Pavel Kolev, Georg Martius, Michael Muehlebach

    Abstract: In many applications, learning systems are required to process continuous non-stationary data streams. We study this problem in an online learning framework and propose an algorithm that can deal with adversarial time-varying and nonlinear constraints. As we show in our work, the algorithm called Constraint Violation Velocity Projection (CVV-Pro) achieves $\sqrt{T}$ regret and converges to the fea… ▽ More

    Submitted 13 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  7. arXiv:2305.08536  [pdf, other

    math.OC

    A Dynamical Systems Perspective on Discrete Optimization

    Authors: Tong Guanchun, Michael Muehlebach

    Abstract: We discuss a dynamical systems perspective on discrete optimization. Departing from the fact that many combinatorial optimization problems can be reformulated as finding low energy spin configurations in corresponding Ising models, we derive a penalized rank-two relaxation of the Ising formulation. It turns out that the associated gradient flow dynamics exactly correspond to a type of hardware sol… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  8. arXiv:2303.09261  [pdf, other

    math.OC stat.ML

    Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold

    Authors: Sholom Schechtman, Daniil Tiapkin, Michael Muehlebach, Eric Moulines

    Abstract: We consider the problem of minimizing a non-convex function over a smooth manifold $\mathcal{M}$. We propose a novel algorithm, the Orthogonal Directions Constrained Gradient Method (ODCGM) which only requires computing a projection onto a vector space. ODCGM is infeasible but the iterates are constantly pulled towards the manifold, ensuring the convergence of ODCGM towards $\mathcal{M}$. ODCGM is… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  9. arXiv:2302.00316  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Accelerated First-Order Optimization under Nonlinear Constraints

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected gradients, these algorithms avoid optimization over the entire feasible set at each iteration. We prove convergence to stationary points even in a nonconvex setting… ▽ More

    Submitted 2 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 44 pages, 6 figures

  10. arXiv:2206.02953  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

    Authors: Aniket Das, Bernhard Schölkopf, Michael Muehlebach

    Abstract: We analyze the convergence rates of stochastic gradient algorithms for smooth finite-sum minimax optimization and show that, for many such algorithms, sampling the data points without replacement leads to faster convergence compared to sampling with replacement. For the smooth and strongly convex-strongly concave setting, we consider gradient descent ascent and the proximal point method, and prese… ▽ More

    Submitted 10 October, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  11. arXiv:2107.08225  [pdf, other

    math.OC cs.LG eess.SY

    On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We introduce a class of first-order methods for smooth constrained optimization that are based on an analogy to non-smooth dynamical systems. Two distinctive features of our approach are that (i) projections or optimizations over the entire feasible set are avoided, in stark contrast to projected gradient methods or the Frank-Wolfe method, and (ii) iterates are allowed to become infeasible, which… ▽ More

    Submitted 5 November, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: 47 pages, 11 figures

  12. arXiv:2002.12493  [pdf, other

    math.OC math.NA stat.ML

    Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic Perspectives

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We analyze the convergence rate of various momentum-based optimization algorithms from a dynamical systems point of view. Our analysis exploits fundamental topological properties, such as the continuous dependence of iterates on their initial conditions, to provide a simple characterization of convergence rates. In many cases, closed-form expressions are obtained that relate algorithm parameters t… ▽ More

    Submitted 12 April, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: 30 pages; 20 pages appendix and references

  13. arXiv:2002.03546  [pdf, ps, other

    math.OC eess.SY

    Continuous-time Lower Bounds for Gradient-based Algorithms

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: This article derives lower bounds on the convergence rate of continuous-time gradient-based optimization algorithms. The algorithms are subjected to a time-normalization constraint that avoids a reparametrization of time in order to make the discussion of continuous-time convergence rates meaningful. We reduce the multi-dimensional problem to a single dimension, recover well-known lower bounds fro… ▽ More

    Submitted 3 August, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 13 pages

  14. arXiv:1905.07436  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    A Dynamical Systems Perspective on Nesterov Acceleration

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We present a dynamical system framework for understanding Nesterov's accelerated gradient method. In contrast to earlier work, our derivation does not rely on a vanishing step size argument. We show that Nesterov acceleration arises from discretizing an ordinary differential equation with a semi-implicit Euler integration scheme. We analyze both the underlying differential equation as well as the… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 11 pages, 4 figures, to appear in the Proceedings of the 36th International Conference on Machine Learning

  15. arXiv:1608.08823  [pdf, other

    math.OC eess.SY

    Approximation of Continuous-Time Infinite-Horizon Optimal Control Problems Arising in Model Predictive Control - Supplementary Notes

    Authors: Michael Muehlebach, Raffaello D'Andrea

    Abstract: These notes present preliminary results regarding two different approximations of linear infinite-horizon optimal control problems arising in model predictive control. Input and state trajectories are parametrized with basis functions and a finite dimensional representation of the dynamics is obtained via a Galerkin approach. It is shown that the two approximations provide lower, respectively uppe… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

    Comments: Supplementary notes, 10 pages