Skip to main content

Showing 1–3 of 3 results for author: Arslantas, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08906  [pdf, other

    cs.GT cs.AI math.OC

    Strategizing against Q-learners: A Control-theoretical Approach

    Authors: Yuksel Arslantas, Ege Yuceel, Muhammed O. Sayin

    Abstract: In this paper, we explore the susceptibility of the independent Q-learning algorithms (a classical and widely used multi-agent reinforcement learning method) to strategic manipulation of sophisticated opponents in normal-form games played repeatedly. We quantify how much strategically sophisticated agents can exploit naive Q-learners if they know the opponents' Q-learning algorithm. To this end, w… ▽ More

    Submitted 25 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  2. arXiv:2402.02147  [pdf, other

    cs.GT

    Team Collaboration vs Competition: New Fictitious Play Dynamics for Multi-team Zero-Sum Games

    Authors: Ahmed Said Donmez, Yuksel Arslantas, Muhammed O. Sayin

    Abstract: This paper presents a new variant of fictitious play (FP) called team-fictitious-play (Team-FP) that can reach equilibrium in multi-team competition, different from the other variants of FP. We specifically focus on zero-sum potential team games with network separable interactions (ZSPTGs), unifying potential games (if there is a single team) and zero-sum polymatrix games (if each team has a singl… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  3. arXiv:2311.00778  [pdf, other

    cs.GT

    Convergence of Heterogeneous Learning Dynamics in Zero-sum Stochastic Games

    Authors: Yuksel Arslantas, Ege Yuceel, Yigit Yalin, Muhammed O. Sayin

    Abstract: This paper presents new families of algorithms for the repeated play of two-agent (near) zero-sum games and two-agent zero-sum stochastic games. For example, the family includes fictitious play and its variants as members. Commonly, the algorithms in this family are all uncoupled, rational, and convergent even in heterogeneous cases, e.g., where the dynamics may differ in terms of learning rates,… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.