Skip to main content

Showing 1–6 of 6 results for author: Barde, P

.
  1. arXiv:2305.17198  [pdf, other

    cs.LG cs.AI cs.MA

    A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

    Authors: Paul Barde, Jakob Foerster, Derek Nowrouzezahrai, Amy Zhang

    Abstract: Training multiple agents to coordinate is an essential problem with applications in robotics, game theory, economics, and social sciences. However, most existing Multi-Agent Reinforcement Learning (MARL) methods are online and thus impractical for real-world applications in which collecting new interactions is costly or dangerous. While these algorithms should leverage offline data when available,… ▽ More

    Submitted 18 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  2. arXiv:2112.07342  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Guide and to Be Guided in the Architect-Builder Problem

    Authors: Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

    Abstract: We are interested in interactive agents that learn to coordinate, namely, a $builder$ -- which performs actions but ignores the goal of the task, i.e. has no access to rewards -- and an $architect$ which guides the builder towards the goal of the task. We define and explore a formal setting where artificial agents are equipped with mechanisms that allow them to simultaneously learn a task while at… ▽ More

    Submitted 11 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (2022)

  3. arXiv:2010.03691  [pdf, other

    cs.LG

    Regularized Inverse Reinforcement Learning

    Authors: Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

    Abstract: Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL applies strongly convex regularizers to the learner's policy in order to avoid the expert's behavior being rationalized by arbitrary constant rewards, also known as degenerate solutions. We propose tractable soluti… ▽ More

    Submitted 2 December, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: 26 pages, 7 figures

  4. arXiv:2006.13258  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

    Authors: Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

    Abstract: Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator. This alternated optimization is known to be delicate in practice since it compounds unstable adversarial training with brittle and sample-inefficient reinforcement learning.… ▽ More

    Submitted 16 April, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  5. arXiv:2002.10525  [pdf, other

    cs.MA cs.LG

    Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

    Authors: Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau

    Abstract: Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a recent approach that applies single-agent AIRL to multi-agent problems where we seek to recover both policies for our agents and reward functions that promote expert-like behavior. While MA-AIRL has promising results on cooperative and competitive tasks, it is sample-inefficient and has only been validated empirically for small… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

  6. arXiv:1908.02269  [pdf, other

    cs.LG cs.MA stat.ML

    Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

    Authors: Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide t… ▽ More

    Submitted 9 November, 2020; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: 23 pages, 16 figures. This revised version contains additional results and minor edits