Skip to main content

Showing 1–4 of 4 results for author: Karimi, M R

Searching in archive math. Search in all archives.
.
  1. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  2. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  3. arXiv:2210.13867  [pdf, ps, other

    cs.LG math.PR math.ST

    A Dynamical System View of Langevin-Based Non-Convex Sampling

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Non-convex sampling is a key challenge in machine learning, central to non-convex optimization in deep learning as well as to approximate probabilistic inference. Despite its significance, theoretically there remain many important challenges: Existing guarantees (1) typically only hold for the averaged iterates rather than the more desirable last iterates, (2) lack convergence metrics that capture… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: typos corrected, references added

    MSC Class: 62D05

  4. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48