Skip to main content

Showing 1–5 of 5 results for author: Poupart, P

Searching in archive math. Search in all archives.
.
  1. arXiv:2403.11062  [pdf, other

    cs.LG math.OC

    A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

    Authors: Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart

    Abstract: Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their practical applications. This inefficiency stems from two main facts: a focus on tail-end performance that overlooks many sampled trajectories, and the potential of gradient vanishing when the lower tail of the return di… ▽ More

    Submitted 28 June, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: RLC 2024

  2. arXiv:2006.14592  [pdf, other

    cs.LG math.OC stat.ML

    Newton-type Methods for Minimax Optimization

    Authors: Guojun Zhang, Kaiwen Wu, Pascal Poupart, Yaoliang Yu

    Abstract: Differential games, in particular two-player sequential zero-sum games (a.k.a. minimax optimization), have been an important modeling tool in applied science and received renewed interest in machine learning due to many recent applications, such as adversarial training, generative models and reinforcement learning. However, existing theory mostly focuses on convex-concave functions with few except… ▽ More

    Submitted 18 February, 2023; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: code update

  3. arXiv:2003.03731  [pdf, ps, other

    math.OC cs.LG math.AG

    A Positivstellensatz for Conditional SAGE Signomials

    Authors: Allen Houze Wang, Priyank Jaini, Yaoliang Yu, Pascal Poupart

    Abstract: Recently, the conditional SAGE certificate has been proposed as a sufficient condition for signomial positivity over a convex set. In this article, we show that the conditional SAGE certificate is $\textit{complete}$. That is, for any signomial $f(\mathbf{x}) = \sum_{j=1}^{\ell}c_j \exp(\mathbf{A}_j\mathbf{x})$ defined by rational exponents that is positive over a compact convex set $\mathcal{X}$,… ▽ More

    Submitted 24 October, 2020; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: 19 pages, preprint

  4. arXiv:2002.11875  [pdf, other

    cs.LG math.OC stat.ML

    Optimality and Stability in Non-Convex Smooth Games

    Authors: Guojun Zhang, Pascal Poupart, Yaoliang Yu

    Abstract: Convergence to a saddle point for convex-concave functions has been studied for decades, while recent years has seen a surge of interest in non-convex (zero-sum) smooth games, motivated by their recent wide applications. It remains an intriguing research challenge how local optimal points are defined and which algorithm can converge to such points. An interesting concept is known as the local mini… ▽ More

    Submitted 3 February, 2022; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: accepted by JMLR 2022

  5. arXiv:1907.03783  [pdf, other

    cs.LG math.ST stat.ML

    Comparing EM with GD in Mixture Models of Two Components

    Authors: Guojun Zhang, Pascal Poupart, George Trimponias

    Abstract: The expectation-maximization (EM) algorithm has been widely used in minimizing the negative log likelihood (also known as cross entropy) of mixture models. However, little is understood about the goodness of the fixed points it converges to. In this paper, we study the regions where one component is missing in two-component mixture models, which we call one-cluster regions. We analyze the propensi… ▽ More

    Submitted 29 October, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: UAI 2019