Skip to main content

Showing 1–1 of 1 results for author: Sullins, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.03094  [pdf, other

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Combining No-regret and Q-learning

    Authors: Ian A. Kash, Michael Sullins, Katja Hofmann

    Abstract: Counterfactual Regret Minimization (CFR) has found success in settings like poker which have both terminal states and perfect recall. We seek to understand how to relax these requirements. As a first step, we introduce a simple algorithm, local no-regret learning (LONR), which uses a Q-learning-like update rule to allow learning without terminal states or perfect recall. We prove its convergence f… ▽ More

    Submitted 13 January, 2022; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Presented as conference paper at AAMAS 2020