Skip to main content

Showing 1–2 of 2 results for author: Harding, S A

.
  1. arXiv:2102.03479  [pdf, other

    cs.LG cs.AI cs.MA

    Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning

    Authors: Jian Hu, Siyang Jiang, Seth Austin Harding, Haibin Wu, Shih-wei Liao

    Abstract: Many complex multi-agent systems such as robot swarms control and autonomous vehicle coordination can be modeled as Multi-Agent Reinforcement Learning (MARL) tasks. QMIX, a widely popular MARL algorithm, has been used as a baseline for the benchmark environments, e.g., Starcraft Multi-Agent Challenge (SMAC), Difficulty-Enhanced Predator-Prey (DEPP). Recent variants of QMIX target relaxing the mono… ▽ More

    Submitted 8 June, 2023; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Accepted by ICLR BlogTrack 2023

  2. arXiv:2009.04197   

    cs.LG cs.MA stat.ML

    QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

    Authors: Jian Hu, Seth Austin Harding, Haibin Wu, Siyue Hu, Shih-wei Liao

    Abstract: In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation and random sampling, the randomness in rewards and observations leads to randomness in long-term returns. Existing methods such as Value Decomposition Network… ▽ More

    Submitted 23 February, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: There are some experimental errors and experimental unfairness in this paper that will seriously affect the later studies