Skip to main content

Showing 1–9 of 9 results for author: Hadiji, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14059  [pdf, other

    cs.GT cs.LG math.OC stat.ML

    Tracking solutions of time-varying variational inequalities

    Authors: Hédi Hadiji, Sarah Sachs, Cristóbal Guzmán

    Abstract: Tracking the solution of time-varying variational inequalities is an important problem with applications in game theory, optimization, and machine learning. Existing work considers time-varying games or time-varying optimization problems. For strongly convex optimization problems or strongly monotone games, these results provide tracking guarantees under the assumption that the variation of the ti… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2304.12768  [pdf, ps, other

    cs.GT math.OC stat.ML

    Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games

    Authors: Hédi Hadiji, Sarah Sachs, Tim van Erven, Wouter M. Koolen

    Abstract: In the first-order query model for zero-sum $K\times K$ matrix games, players observe the expected pay-offs for all their possible actions under the randomized action played by their opponent. This classical model has received renewed interest after the discovery by Rakhlin and Sridharan that $ε$-approximate Nash equilibria can be computed efficiently from $O(\frac{\ln K}ε)$ instead of… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  3. arXiv:2303.03272  [pdf, ps, other

    cs.LG math.OC stat.ML

    Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Cristobal Guzman

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Extended version of 'Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness' by the same authors. arXiv admin note: text overlap with arXiv:2202.07554

  4. arXiv:2202.07554  [pdf, ps, other

    cs.LG math.OC stat.ML

    Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness

    Authors: Sarah Sachs, Hédi Hadiji, Tim van Erven, Cristóbal Guzmán

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  5. arXiv:2202.05630  [pdf, ps, other

    cs.LG

    Scale-free Unconstrained Online Learning for Curved Losses

    Authors: Jack J. Mayo, Hédi Hadiji, Tim van Erven

    Abstract: A sequence of works in unconstrained online convex optimisation have investigated the possibility of adapting simultaneously to the norm $U$ of the comparator and the maximum norm $G$ of the gradients. In full generality, matching upper and lower bounds are known which show that this comes at the unavoidable cost of an additive $G U^3$, which is not needed when either $G$ or $U$ is known in advanc… ▽ More

    Submitted 15 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: 34 pages

  6. arXiv:2102.07521  [pdf, ps, other

    cs.LG stat.ML

    Distributed Online Learning for Joint Regret with Communication Constraints

    Authors: Dirk van der Hoeven, Hédi Hadiji, Tim van Erven

    Abstract: We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a prediction and provides a corresponding gradient, and then the agents are allowed to send a $b$-bit message to their neighbors in the graph. All agents cooperate to co… ▽ More

    Submitted 25 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  7. arXiv:2010.01874  [pdf, ps, other

    stat.ML cs.LG

    Diversity-Preserving K-Armed Bandits, Revisited

    Authors: Hédi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz

    Abstract: We consider the bandit-based framework for diversity-preserving recommendations introduced by Celis et al. (2019), who approached it in the case of a polytope mainly by a reduction to the setting of linear bandits. We design a UCB algorithm using the specific structure of the setting and show that it enjoys a bounded distribution-dependent regret in the natural cases when the optimal mixed actions… ▽ More

    Submitted 15 April, 2024; v1 submitted 5 October, 2020; originally announced October 2020.

  8. arXiv:1905.10221  [pdf, other

    stat.ML cs.LG math.ST

    Polynomial Cost of Adaptation for X -Armed Bandits

    Authors: Hédi Hadiji

    Abstract: In the context of stochastic continuum-armed bandits, we present an algorithm that adapts to the unknown smoothness of the objective function. We exhibit and compute a polynomial cost of adaptation to the H{ö}lder regularity for regret minimization. To do this, we first reconsider the recent lower bound of Locatelli and Carpentier [20], and define and characterize admissible rate functions. Our ne… ▽ More

    Submitted 9 December, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: Thirty-third Conference on Neural Information Processing Systems, Dec 2019, Vancouver, France

  9. arXiv:1805.05071  [pdf, other

    stat.ML cs.LG math.ST

    KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints

    Authors: Aurélien Garivier, Hédi Hadiji, Pierre Menard, Gilles Stoltz

    Abstract: We consider $K$-armed stochastic bandits and consider cumulative regret bounds up to time $T$. We are interested in strategies achieving simultaneously a distribution-free regret bound of optimal order $\sqrt{KT}$ and a distribution-dependent regret that is asymptotically optimal, that is, matching the $κ\ln T$ lower bound by Lai and Robbins (1985) and Burnetas and Katehakis (1996), where $κ$ is t… ▽ More

    Submitted 1 July, 2022; v1 submitted 14 May, 2018; originally announced May 2018.