Skip to main content

Showing 1–1 of 1 results for author: Keval, K P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.12613  [pdf, other

    eess.SY cs.LG cs.MA

    Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion

    Authors: Keshav P. Keval, Vivek S. Borkar

    Abstract: In this paper, we propose a reinforcement learning algorithm to solve a multi-agent Markov decision process (MMDP). The goal, inspired by Blackwell's Approachability Theorem, is to lower the time average cost of each agent to below a pre-specified agent-specific bound. For the MMDP, we assume the state dynamics to be controlled by the joint actions of agents, but the per-stage costs to only depend… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.