Skip to main content

Showing 1–7 of 7 results for author: Arslan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18086  [pdf, ps, other

    cs.GT econ.TH

    Generalizing Better Response Paths and Weakly Acyclic Games

    Authors: Bora Yongacoglu, Gürdal Arslan, Lacra Pavel, Serdar Yüksel

    Abstract: Weakly acyclic games generalize potential games and are fundamental to the study of game theoretic control. In this paper, we present a generalization of weakly acyclic games, and we observe its importance in multi-agent learning when agents employ experimental strategy updates in periods where they fail to best respond. While weak acyclicity is defined in terms of path connectivity properties of… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2403.18079  [pdf, ps, other

    cs.GT cs.AI cs.LG

    Paths to Equilibrium in Normal-Form Games

    Authors: Bora Yongacoglu, Gürdal Arslan, Lacra Pavel, Serdar Yüksel

    Abstract: In multi-agent reinforcement learning (MARL), agents repeatedly interact across time and revise their strategies as new data arrives, producing a sequence of strategy profiles. This paper studies sequences of strategies satisfying a pairwise constraint inspired by policy updating in reinforcement learning, where an agent who is best responding in period $t$ does not switch its strategy in the next… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  3. arXiv:2308.03239  [pdf, other

    cs.GT cs.LG cs.MA

    Asynchronous Decentralized Q-Learning: Two Timescale Analysis By Persistence

    Authors: Bora Yongacoglu, Gürdal Arslan, Serdar Yüksel

    Abstract: Non-stationarity is a fundamental challenge in multi-agent reinforcement learning (MARL), where agents update their behaviour as they learn. Many theoretical advances in MARL avoid the challenge of non-stationarity by coordinating the policy updates of agents in various ways, including synchronizing times at which agents are allowed to revise their policies. Synchronization enables analysis of man… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  4. arXiv:2308.01123  [pdf, other

    eess.SY cs.RO

    Planar Friction Modelling with LuGre Dynamics and Limit Surfaces

    Authors: Gabriel Arslan Waltersson, Yiannis Karayiannidis

    Abstract: During planar motion, contact surfaces exhibit a coupling between tangential and rotational friction forces. This paper proposes planar friction models grounded in the LuGre model and limit surface theory. First, distributed planar extended state models are proposed and the Elasto-Plastic model is extended for multi-dimensional friction. Subsequently, we derive a reduced planar friction model, cou… ▽ More

    Submitted 22 May, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted version

  5. arXiv:2209.05703  [pdf, other

    cs.GT

    Independent Learning in Mean-Field Games: Satisficing Paths and Convergence to Subjective Equilibria

    Authors: Bora Yongacoglu, Gürdal Arslan, Serdar Yüksel

    Abstract: Independent learners are agents that employ single-agent algorithms in multi-agent systems, intentionally ignoring the effect of other strategic agents. This paper studies mean-field games from a decentralized learning perspective, with two primary objectives: (i) to identify structure that can guide algorithm design, and (ii) to understand the emergent behaviour in systems of independent learners… ▽ More

    Submitted 23 November, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

  6. arXiv:2110.04638  [pdf, other

    cs.GT cs.LG math.OC

    Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games

    Authors: Bora Yongacoglu, Gürdal Arslan, Serdar Yüksel

    Abstract: In multi-agent reinforcement learning (MARL), independent learners are those that do not observe the actions of other agents in the system. Due to the decentralization of information, it is challenging to design independent learners that drive play to equilibrium. This paper investigates the feasibility of using satisficing dynamics to guide independent learners to approximate equilibrium in stoch… ▽ More

    Submitted 19 February, 2023; v1 submitted 9 October, 2021; originally announced October 2021.

    Journal ref: SIAM Journal on Mathematics of Data Science, vol 5, no. 3, pp. 745-773, Aug 2023

  7. arXiv:1506.07924  [pdf, ps, other

    math.OC cs.GT cs.LG

    Decentralized Q-Learning for Stochastic Teams and Games

    Authors: Gürdal Arslan, Serdar Yüksel

    Abstract: There are only a few learning algorithms applicable to stochastic dynamic teams and games which generalize Markov decision processes to decentralized stochastic control problems involving possibly self-interested decision makers. Learning in games is generally difficult because of the non-stationary environment in which each decision maker aims to learn its optimal decisions with minimal informati… ▽ More

    Submitted 2 May, 2016; v1 submitted 25 June, 2015; originally announced June 2015.

    Comments: To appear in IEEE Trans. Automatic Control