Skip to main content

Showing 1–13 of 13 results for author: Ochs, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18222  [pdf, other

    cs.LG math.OC

    From Learning to Optimize to Learning Optimization Algorithms

    Authors: Camille Castera, Peter Ochs

    Abstract: Towards designing learned optimization algorithms that are usable beyond their training setting, we identify key principles that classical algorithms obey, but have up to now, not been used for Learning to Optimize (L2O). Following these principles, we provide a general design pipeline, taking into account data, architecture and learning strategy, and thereby enabling a synergy between classical o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.03290  [pdf, other

    cs.LG math.OC

    Learning-to-Optimize with PAC-Bayesian Guarantees: Theoretical Considerations and Practical Implementation

    Authors: Michael Sucker, Jalal Fadili, Peter Ochs

    Abstract: We use the PAC-Bayesian theory for the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-Bayesian bounds) and explicit trade-off between convergence guarantees and convergence speed, which contrasts with the typical worst-case analysis. Our learned optimization algorithms prova… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2311.10053  [pdf, other

    math.OC cs.LG math.DS

    Near-optimal Closed-loop Method via Lyapunov Dam** for Convex Optimization

    Authors: Severin Maier, Camille Castera, Peter Ochs

    Abstract: We introduce an autonomous system with closed-loop dam** for first-order convex optimization. While, to this day, optimal rates of convergence are almost exclusively achieved by non-autonomous methods via open-loop dam** (e.g., Nesterov's algorithm), we show that our system, featuring a closed-loop dam**, exhibits a rate arbitrarily close to the optimal one. We do so by coupling the dam**… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2210.11113  [pdf, other

    cs.LG stat.ML

    PAC-Bayesian Learning of Optimization Algorithms

    Authors: Michael Sucker, Peter Ochs

    Abstract: We apply the PAC-Bayes theory to the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-bounds) and explicit trade-off between a high probability of convergence and a high convergence speed. Even in the limit case, where convergence is guaranteed, our learned optimization algori… ▽ More

    Submitted 15 February, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to AISTATS 2023

  5. arXiv:2208.03107  [pdf, other

    math.OC cs.LG

    Fixed-Point Automatic Differentiation of Forward--Backward Splitting Algorithms for Partly Smooth Functions

    Authors: Sheheryar Mehmood, Peter Ochs

    Abstract: A large class of non-smooth practical optimization problems can be written as minimization of a sum of smooth and partly smooth functions. We consider such structured problems which also depend on a parameter vector and study the problem of differentiating its solution map** with respect to the parameter which has far reaching applications in sensitivity analysis and parameter learning optmizati… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  6. arXiv:2012.13161  [pdf, other

    math.OC cs.CV cs.LG math.NA

    Global Convergence of Model Function Based Bregman Proximal Minimization Algorithms

    Authors: Mahesh Chandra Mukkamala, Jalal Fadili, Peter Ochs

    Abstract: Lipschitz continuity of the gradient map** of a continuously differentiable function plays a crucial role in designing various optimization algorithms. However, many functions arising in practical applications such as low rank matrix factorization or deep neural network problems do not have a Lipschitz continuous gradient. This led to the development of a generalized notion known as the $L$-smad… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 44 pages, 22 figures

    MSC Class: 90C25; 26B25; 49M27; 52A41; 65K05

  7. arXiv:2008.07872  [pdf, other

    cs.CV

    Self-supervised Sparse to Dense Motion Segmentation

    Authors: Amirhossein Kardoost, Kalun Ho, Peter Ochs, Margret Keuper

    Abstract: Observable motion in videos can give rise to the definition of objects moving with respect to the scene. The task of segmenting such moving objects is referred to as motion segmentation and is usually tackled either by aggregating motion information in long, sparse point trajectories, or by directly producing per frame dense segmentations relying on large amounts of training data. In this paper, w… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  8. arXiv:1910.03638  [pdf, other

    math.OC cs.CV cs.IR cs.LG

    Bregman Proximal Framework for Deep Linear Neural Networks

    Authors: Mahesh Chandra Mukkamala, Felix Westerkamp, Emanuel Laude, Daniel Cremers, Peter Ochs

    Abstract: A typical assumption for the analysis of first order optimization methods is the Lipschitz continuity of the gradient of the objective function. However, for many practical applications this assumption is violated, including loss functions in deep learning. To overcome this issue, certain extensions based on generalized proximity measures known as Bregman distances were introduced. This initiated… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: 34 pages, 54 images

    MSC Class: 90C26; 26B25; 90C30; 49M27; 47J25; 65K05; 65F22

  9. arXiv:1905.09050  [pdf, other

    math.OC cs.CV cs.IR stat.ML

    Beyond Alternating Updates for Matrix Factorization with Inertial Bregman Proximal Gradient Algorithms

    Authors: Mahesh Chandra Mukkamala, Peter Ochs

    Abstract: Matrix Factorization is a popular non-convex optimization problem, for which alternating minimization schemes are mostly used. They usually suffer from the major drawback that the solution is biased towards one of the optimization variables. A remedy is non-alternating schemes. However, due to a lack of Lipschitz continuity of the gradient in matrix factorization problems, convergence cannot be gu… ▽ More

    Submitted 6 December, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: Accepted at NeuRIPS 2019. Paper url: http://papers.nips.cc/paper/8679-beyond-alternating-updates-for-matrix-factorization-with-inertial-bregman-proximal-gradient-algorithms

  10. arXiv:1904.03537  [pdf, other

    math.OC cs.CV cs.LG math.NA

    Convex-Concave Backtracking for Inertial Bregman Proximal Gradient Algorithms in Non-Convex Optimization

    Authors: Mahesh Chandra Mukkamala, Peter Ochs, Thomas Pock, Shoham Sabach

    Abstract: Backtracking line-search is an old yet powerful strategy for finding a better step sizes to be used in proximal gradient algorithms. The main principle is to locally find a simple convex upper bound of the objective function, which in turn controls the step size that is used. In case of inertial proximal gradient algorithms, the situation becomes much more difficult and usually leads to very restr… ▽ More

    Submitted 5 November, 2019; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: 29 pages

    MSC Class: 90C25; 26B25; 49M27; 52A41; 65K05

  11. arXiv:1901.08087  [pdf, other

    math.OC cs.LG

    Model Function Based Conditional Gradient Method with Armijo-like Line Search

    Authors: Yura Malitsky, Peter Ochs

    Abstract: The Conditional Gradient Method is generalized to a class of non-smooth non-convex optimization problems with many applications in machine learning. The proposed algorithm iterates by minimizing so-called model functions over the constraint set. Complemented with an Amijo line search procedure, we prove that subsequences converge to a stationary point. The abstract framework of model functions pro… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  12. arXiv:1803.08660  [pdf, other

    cs.CV cs.NE

    Lifting Layers: Analysis and Applications

    Authors: Peter Ochs, Tim Meinhardt, Laura Leal-Taixe, Michael Moeller

    Abstract: The great advances of learning-based approaches in image processing and computer vision are largely based on deeply nested networks that compose linear transfer functions with suitable non-linearities. Interestingly, the most frequently used non-linearities in imaging applications (variants of the rectified linear unit) are uncommon in low dimensional approximation problems. In this paper we propo… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  13. arXiv:1404.4805  [pdf, other

    cs.CV math.OC

    iPiano: Inertial Proximal Algorithm for Non-Convex Optimization

    Authors: Peter Ochs, Yun** Chen, Thomas Brox, Thomas Pock

    Abstract: In this paper we study an algorithm for solving a minimization problem composed of a differentiable (possibly non-convex) and a convex (possibly non-differentiable) function. The algorithm iPiano combines forward-backward splitting with an inertial force. It can be seen as a non-smooth split version of the Heavy-ball method from Polyak. A rigorous analysis of the algorithm for the proposed class o… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.

    Comments: 32pages, 7 figures, to appear in SIAM Journal on Imaging Sciences