Skip to main content

Showing 1–6 of 6 results for author: Black, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.07015  [pdf, other

    cs.LG cs.AI

    Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments

    Authors: Ryan Sullivan, J. K. Terry, Benjamin Black, John P. Dickerson

    Abstract: Visualizing optimization landscapes has led to many fundamental insights in numeric optimization, and novel improvements to optimization techniques. However, visualizations of the objective that reinforcement learning optimizes (the "reward surface") have only ever been generated for a small number of narrow contexts. This work presents reward surfaces and related visualizations of 27 of the most… ▽ More

    Submitted 21 September, 2022; v1 submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted at ICML 2022 Camera-Ready Version

  2. arXiv:2009.14471  [pdf, other

    cs.LG cs.MA stat.ML

    PettingZoo: Gym for Multi-Agent Reinforcement Learning

    Authors: J. K. Terry, Benjamin Black, Nathaniel Grammel, Mario Jayakumar, Ananth Hari, Ryan Sullivan, Luis Santos, Rodrigo Perez, Caroline Horsch, Clemens Dieffendahl, Niall L. Williams, Yashas Lokesh, Praveen Ravi

    Abstract: This paper introduces the PettingZoo library and the accompanying Agent Environment Cycle ("AEC") games model. PettingZoo is a library of diverse sets of multi-agent environments with a universal, elegant Python API. PettingZoo was developed with the goal of accelerating research in Multi-Agent Reinforcement Learning ("MARL"), by making work more interchangeable, accessible and reproducible akin… ▽ More

    Submitted 26 October, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

  3. arXiv:2009.13051   

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Agent Environment Cycle Games

    Authors: J K Terry, Nathaniel Grammel, Benjamin Black, Ananth Hari, Caroline Horsch, Luis Santos

    Abstract: Partially Observable Stochastic Games (POSGs) are the most general and common model of games used in Multi-Agent Reinforcement Learning (MARL). We argue that the POSG model is conceptually ill suited to software MARL environments, and offer case studies from the literature where this mismatch has led to severely unexpected behavior. In response to this, we introduce the Agent Environment Cycle G… ▽ More

    Submitted 1 May, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: This work of this paper has been merged into the paper "PettingZoo: Gym for Multi-Agent Reinforcement Learning" arXiv:2009.14471

  4. arXiv:2009.09341  [pdf, other

    cs.LG cs.AI

    Multiplayer Support for the Arcade Learning Environment

    Authors: J. K. Terry, Benjamin Black, Luis Santos

    Abstract: The Arcade Learning Environment ("ALE") is a widely used library in the reinforcement learning community that allows easy programmatic interfacing with Atari 2600 games, via the Stella emulator. We introduce a publicly available extension to the ALE that extends its support to multiplayer games and game modes. This interface is additionally integrated with PettingZoo to allow for a simple Gym-li… ▽ More

    Submitted 17 January, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

  5. arXiv:2008.08932  [pdf, ps, other

    cs.LG cs.AI

    SuperSuit: Simple Microwrappers for Reinforcement Learning Environments

    Authors: J. K. Terry, Benjamin Black, Ananth Hari

    Abstract: In reinforcement learning, wrappers are universally used to transform the information that passes between a model and an environment. Despite their ubiquity, no library exists with reasonable implementations of all popular preprocessing methods. This leads to unnecessary bugs, code inefficiencies, and wasted developer time. Accordingly we introduce SuperSuit, a Python library that includes all p… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

  6. arXiv:2005.13625  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning

    Authors: J. K. Terry, Nathaniel Grammel, Sanghyun Son, Benjamin Black, Aakriti Agrawal

    Abstract: Parameter sharing, where each agent independently learns a policy with fully shared parameters between all policies, is a popular baseline method for multi-agent deep reinforcement learning. Unfortunately, since all agents share the same policy network, they cannot learn different policies or tasks. This issue has been circumvented experimentally by adding an agent-specific indicator signal to obs… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2020; originally announced May 2020.