Skip to main content

Showing 1–17 of 17 results for author: Hsieh, Y

Searching in archive math. Search in all archives.
.
  1. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  2. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  3. arXiv:2210.13867  [pdf, ps, other

    cs.LG math.PR math.ST

    A Dynamical System View of Langevin-Based Non-Convex Sampling

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Non-convex sampling is a key challenge in machine learning, central to non-convex optimization in deep learning as well as to approximate probabilistic inference. Despite its significance, theoretically there remain many important challenges: Existing guarantees (1) typically only hold for the averaged iterates rather than the more desirable last iterates, (2) lack convergence metrics that capture… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: typos corrected, references added

    MSC Class: 62D05

  4. arXiv:2207.07105  [pdf, ps, other

    stat.ML cs.LG math.OC

    Continuous-time Analysis for Variational Inequalities: An Overview and Desiderata

    Authors: Tatjana Chavdarova, Ya-** Hsieh, Michael I. Jordan

    Abstract: Algorithms that solve zero-sum games, multi-objective agent objectives, or, more generally, variational inequality (VI) problems are notoriously unstable on general problems. Owing to the increasing need for solving such problems in machine learning, this instability has been highlighted in recent years as a significant research challenge. In this paper, we provide an overview of recent progress i… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  5. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48

  6. arXiv:2206.04113  [pdf, other

    math.OC cs.DC cs.LG cs.MA

    Push--Pull with Device Sampling

    Authors: Yu-Guan Hsieh, Yassine Laguel, Franck Iutzeler, Jérôme Malick

    Abstract: We consider decentralized optimization problems in which a number of agents collaborate to minimize the average of their local functions by exchanging over an underlying communication graph. Specifically, we place ourselves in an asynchronous model where only a random portion of nodes perform computation at each iteration, while the information exchange can be conducted between all the nodes and i… ▽ More

    Submitted 17 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: In IEEE Transactions on Automatic Control

  7. arXiv:2206.03922  [pdf, other

    cs.GT cs.LG math.OC

    A unified stochastic approximation framework for learning in games

    Authors: Panayotis Mertikopoulos, Ya-** Hsieh, Volkan Cevher

    Abstract: We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential/multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In a… ▽ More

    Submitted 3 July, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 40 pages, 5 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 68T02

  8. arXiv:2105.13348  [pdf, other

    math.OC cs.LG cs.MA

    Optimization in Open Networks via Dual Averaging

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In networks of autonomous agents (e.g., fleets of vehicles, scattered sensors), the problem of minimizing the sum of the agents' local functions has received a lot of interest. We tackle here this distributed optimization problem in the case of open networks when agents can join and leave the network at any time. Leveraging recent online optimization techniques, we propose and analyze the converge… ▽ More

    Submitted 16 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: In 60th IEEE Conference on Decision and Control (CDC 2021); 7 pages, 1 figure

  9. arXiv:2104.12761  [pdf, other

    cs.GT cs.LG math.OC

    Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

    Authors: Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos

    Abstract: In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often measured by its regret. However, no-regret algorithms are not created equal in terms of game-theoretic guarantees: depending on how they are tuned, some of them may… ▽ More

    Submitted 16 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: In the 34th Annual Conference on Learning Theory (COLT 2021); 35 pages, 2 figures

  10. arXiv:2012.11579  [pdf, ps, other

    cs.LG cs.MA math.OC

    Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In this paper, we provide a general framework for studying multi-agent online learning problems in the presence of delays and asynchronicities. Specifically, we propose and analyze a class of adaptive dual averaging schemes in which agents only need to accumulate gradient feedback received from the whole system, without requiring any between-agent coordination. In the single-agent case, the adapti… ▽ More

    Submitted 16 April, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted by Journal of Machine Learning Research (JMLR)

  11. arXiv:2007.03795  [pdf, other

    cs.LG math.OC stat.ML

    Conditional gradient methods for stochastically constrained convex minimization

    Authors: Maria-Luiza Vladarean, Ahmet Alacaoglu, Ya-** Hsieh, Volkan Cevher

    Abstract: We propose two novel conditional gradient-based methods for solving structured stochastic convex optimization problems with a large number of linear constraints. Instances of this template naturally arise from SDP-relaxations of combinatorial problems, which involve a number of constraints that is polynomial in the problem dimension. The most important feature of our framework is that only a subse… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  12. arXiv:2006.09065  [pdf, other

    math.OC cs.LG stat.ML

    The limits of min-max optimization algorithms: convergence to spurious non-critical sets

    Authors: Ya-** Hsieh, Panayotis Mertikopoulos, Volkan Cevher

    Abstract: Compared to ordinary function minimization problems, min-max optimization algorithms encounter far greater challenges because of the existence of periodic cycles and similar phenomena. Even though some of these behaviors can be overcome in the convex-concave regime, the general case is considerably more difficult. On that account, we take an in-depth look at a comprehensive class of state-of-the a… ▽ More

    Submitted 14 February, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  13. arXiv:2003.10162  [pdf, other

    math.OC cs.GT cs.LG

    Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Owing to their stability and convergence speed, extragradient methods have become a staple for solving large-scale saddle-point problems in machine learning. The basic premise of these algorithms is the use of an extrapolation step before performing an update; thanks to this exploration step, extra-gradient methods overcome many of the non-convergence issues that plague gradient descent/ascent sch… ▽ More

    Submitted 5 November, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: In Advances in Neural Information Processing Systems 33 (NeurIPS 2020); 29 pages, 5 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  14. arXiv:1908.08465  [pdf, other

    math.OC cs.GT cs.LG

    On the convergence of single-call stochastic extra-gradient methods

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Variational inequalities have recently attracted considerable interest in machine learning as a flexible paradigm for models that go beyond ordinary loss function minimization (such as generative adversarial networks and related deep learning systems). In this setting, the optimal $\mathcal{O}(1/t)$ convergence rate for solving smooth monotone variational inequalities is achieved by the Extra-Grad… ▽ More

    Submitted 11 February, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: In Advances in Neural Information Processing Systems 32 (NeurIPS 2019); 24 pages, 3 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  15. arXiv:1802.10174  [pdf, other

    cs.LG math.OC

    Mirrored Langevin Dynamics

    Authors: Ya-** Hsieh, Ali Kavis, Paul Rolland, Volkan Cevher

    Abstract: We consider the problem of sampling from constrained distributions, which has posed significant challenges to both non-asymptotic analysis and algorithmic design. We propose a unified framework, which is inspired by the classical mirror descent, to derive novel first-order sampling schemes. We prove that, for a general target distribution with strongly convex potential, our framework implies the e… ▽ More

    Submitted 30 December, 2020; v1 submitted 27 February, 2018; originally announced February 2018.

  16. arXiv:1602.00724  [pdf, other

    math.OC stat.AP

    Frank-Wolfe Works for Non-Lipschitz Continuous Gradient Objectives: Scalable Poisson Phase Retrieval

    Authors: Gergely Odor, Yen-Huan Li, Alp Yurtsever, Ya-** Hsieh, Quoc Tran-Dinh, Marwa El Halabi, Volkan Cevher

    Abstract: We study a phase retrieval problem in the Poisson noise model. Motivated by the PhaseLift approach, we approximate the maximum-likelihood estimator by solving a convex program with a nuclear norm constraint. While the Frank-Wolfe algorithm, together with the Lanczos method, can efficiently deal with nuclear norm constraints, our objective function does not have a Lipschitz continuous gradient, and… ▽ More

    Submitted 1 February, 2016; originally announced February 2016.

  17. arXiv:1506.08163  [pdf, ps, other

    math.ST

    A Geometric View on Constrained M-Estimators

    Authors: Yen-Huan Li, Ya-** Hsieh, Nissim Zerbib, Volkan Cevher

    Abstract: We study the estimation error of constrained M-estimators, and derive explicit upper bounds on the expected estimation error determined by the Gaussian width of the constraint set. Both of the cases where the true parameter is on the boundary of the constraint set (matched constraint), and where the true parameter is strictly in the constraint set (mismatched constraint) are considered. For both c… ▽ More

    Submitted 26 June, 2015; originally announced June 2015.