Skip to main content

Showing 1–4 of 4 results for author: Erez, L

.
  1. arXiv:2406.12406  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Fast Rates for Bandit PAC Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel learning algorithm for the agnostic $(\varepsilon,δ)$-PAC version of the problem, with sample complexity of… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2405.10027  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Real Price of Bandit Information in Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted label is correct or not. Our primary inquiry is with regard to the dependency on the number of labels $K$, and whether $T$-step regret bounds in this setting can be… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2207.14211  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Regret Minimization and Convergence to Equilibria in General-sum Markov Games

    Authors: Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour

    Abstract: An abundance of recent impossibility results establish that regret minimization in Markov games with adversarial opponents is both statistically and computationally intractable. Nevertheless, none of these results preclude the possibility of regret minimization under the assumption that all parties adopt the same learning procedure. In this work, we present the first (to our knowledge) algorithm f… ▽ More

    Submitted 8 August, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

  4. arXiv:2107.09572  [pdf, other

    cs.LG

    Best-of-All-Worlds Bounds for Online Learning with Feedback Graphs

    Authors: Liad Erez, Tomer Koren

    Abstract: We study the online learning with feedback graphs framework introduced by Mannor and Shamir (2011), in which the feedback received by the online learner is specified by a graph $G$ over the available actions. We develop an algorithm that simultaneously achieves regret bounds of the form: $\smash{\mathcal{O}(\sqrt{θ(G) T})}$ with adversarial losses; $\mathcal{O}(θ(G)\operatorname{polylog}{T})$ with… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.