Skip to main content

Showing 1–7 of 7 results for author: Ghai, U

Searching in archive math. Search in all archives.
.
  1. arXiv:2305.17552  [pdf, other

    cs.LG math.OC

    Online Nonstochastic Model-Free Reinforcement Learning

    Authors: Udaya Ghai, Arushi Gupta, Wenhan Xia, Karan Singh, Elad Hazan

    Abstract: We investigate robust model-free reinforcement learning algorithms designed for environments that may be dynamic or even adversarial. Traditional state-based policies often struggle to accommodate the challenges imposed by the presence of unmodeled disturbances in such settings. Moreover, optimizing linear state-based policies pose an obstacle for efficient optimization, leading to nonconvex objec… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Camera-ready version for NeurIPS 2023

  2. arXiv:2205.15235  [pdf, other

    cs.LG math.OC

    Non-convex online learning via algorithmic equivalence

    Authors: Udaya Ghai, Zhou Lu, Elad Hazan

    Abstract: We study an algorithmic equivalence technique between non-convex gradient descent and convex mirror descent. We start by looking at a harder problem of regret minimization in online non-convex optimization. We show that under certain geometric and smoothness conditions, online gradient descent applied to non-convex functions is an approximation of online mirror descent applied to convex functions… ▽ More

    Submitted 12 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

  3. arXiv:2201.13288  [pdf, other

    math.OC cs.LG stat.ML

    A Regret Minimization Approach to Multi-Agent Control

    Authors: Udaya Ghai, Udari Madhushani, Naomi Leonard, Elad Hazan

    Abstract: We study the problem of multi-agent control of a dynamical system with known dynamics and adversarial disturbances. Our study focuses on optimal control without centralized precomputed policies, but rather with adaptive control policies for the different agents that are only equipped with a stabilizing controller. We give a reduction from any (standard) regret minimizing control method to a distri… ▽ More

    Submitted 25 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:7422-7434, 2022

  4. arXiv:2107.07732  [pdf, ps, other

    math.OC cs.LG

    Robust Online Control with Model Misspecification

    Authors: Xinyi Chen, Udaya Ghai, Elad Hazan, Alexandre Megretski

    Abstract: We study online control of an unknown nonlinear dynamical system that is approximated by a time-invariant linear system with model misspecification. Our study focuses on robustness, a measure of how much deviation from the assumed linear approximation can be tolerated by a controller while maintaining finite $\ell_2$-gain. A basic methodology to analyze robustness is via the small gain theorem.… ▽ More

    Submitted 4 April, 2022; v1 submitted 16 July, 2021; originally announced July 2021.

  5. arXiv:2012.06695  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Generating Adversarial Disturbances for Controller Verification

    Authors: Udaya Ghai, David Snyder, Anirudha Majumdar, Elad Hazan

    Abstract: We consider the problem of generating maximally adversarial disturbances for a given controller assuming only blackbox access to it. We propose an online learning approach to this problem that \emph{adaptively} generates disturbances based on control inputs chosen by the controller. The goal of the disturbance generator is to minimize \emph{regret} versus a benchmark disturbance-generating policy… ▽ More

    Submitted 31 January, 2022; v1 submitted 11 December, 2020; originally announced December 2020.

  6. arXiv:2002.02064  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    No-Regret Prediction in Marginally Stable Systems

    Authors: Udaya Ghai, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

    Abstract: We consider the problem of online prediction in a marginally stable linear dynamical system subject to bounded adversarial or (non-isotropic) stochastic perturbations. This poses two challenges. Firstly, the system is in general unidentifiable, so recent and classical results on parameter recovery do not apply. Secondly, because we allow the system to be marginally stable, the state can grow polyn… ▽ More

    Submitted 23 June, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Comments: 43 pages. Appears in COLT 2020

  7. arXiv:1902.01903  [pdf, other

    cs.LG math.OC stat.ML

    Exponentiated Gradient Meets Gradient Descent

    Authors: Udaya Ghai, Elad Hazan, Yoram Singer

    Abstract: The (stochastic) gradient descent and the multiplicative update method are probably the most popular algorithms in machine learning. We introduce and study a new regularization which provides a unification of the additive and multiplicative updates. This regularization is derived from an hyperbolic analogue of the entropy function, which we call hypentropy. It is motivated by a natural extension o… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.