Skip to main content

Showing 1–50 of 59 results for author: Prorok, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13600  [pdf, other

    cs.AI

    CoDreamer: Communication-Based Decentralised World Models

    Authors: Edan Toledo, Amanda Prorok

    Abstract: Sample efficiency is a critical challenge in reinforcement learning. Model-based RL has emerged as a solution, but its application has largely been confined to single-agent scenarios. In this work, we introduce CoDreamer, an extension of the Dreamer algorithm for multi-agent environments. CoDreamer leverages Graph Neural Networks for a two-level communication system to tackle challenges such as pa… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2405.15054  [pdf, other

    cs.MA cs.AI cs.LG cs.RO

    Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning

    Authors: Matteo Bettini, Ryan Kortvelesy, Amanda Prorok

    Abstract: The study of behavioral diversity in Multi-Agent Reinforcement Learning (MARL) is a nascent yet promising field. In this context, the present work deals with the question of how to control the diversity of a multi-agent system. With no existing approaches to control diversity to a set value, current solutions focus on blindly promoting it via intrinsic rewards or additional loss functions, effecti… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.02198  [pdf, other

    cs.RO cs.MA eess.SY

    The Cambridge RoboMaster: An Agile Multi-Robot Research Platform

    Authors: Jan Blumenkamp, Ajay Shankar, Matteo Bettini, Joshua Bird, Amanda Prorok

    Abstract: Compact robotic platforms with powerful compute and actuation capabilities are key enablers for practical, real-world deployments of multi-agent research. This article introduces a tightly integrated hardware, control, and simulation software stack on a fleet of holonomic ground robot platforms designed with this motivation. Our robots, a fleet of customised DJI Robomaster S1 vehicles, offer a bal… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2405.01107  [pdf, other

    cs.RO cs.MA eess.SY

    CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications

    Authors: Jan Blumenkamp, Steven Morad, Jennifer Gielis, Amanda Prorok

    Abstract: Autonomous robot operation in unstructured environments is often underpinned by spatial understanding through vision. Systems composed of multiple concurrently operating robots additionally require access to frequent, accurate and reliable pose estimates. Classical vision-based methods to regress relative pose are commonly computationally expensive (precluding real-time applications), and often la… ▽ More

    Submitted 7 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  5. arXiv:2403.14583  [pdf, other

    cs.RO cs.LG cs.MA

    Co-Optimization of Environment and Policies for Decentralized Multi-Agent Navigation

    Authors: Zhan Gao, Guang Yang, Amanda Prorok

    Abstract: This work views the multi-agent system and its surrounding environment as a co-evolving system, where the behavior of one affects the other. The goal is to take both agent actions and environment configurations as decision variables, and optimize these two components in a coordinated manner to improve some measure of interest. Towards this end, we consider the problem of decentralized multi-agent… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  6. arXiv:2403.06750  [pdf, other

    cs.MA cs.LG cs.RO

    Generalising Multi-Agent Cooperation through Task-Agnostic Communication

    Authors: Dulhan Jayalath, Steven Morad, Amanda Prorok

    Abstract: Existing communication methods for multi-agent reinforcement learning (MARL) in cooperative multi-robot problems are almost exclusively task-specific, training new communication strategies for each unique task. We address this inefficiency by introducing a communication strategy applicable to any task within a given environment. We pre-train the communication strategy without task-specific reward… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 12 pages, 6 figures, submitted to Distributed Autonomous Robotic Systems (DARS 2024)

  7. arXiv:2402.09900  [pdf, other

    cs.LG cs.AI

    Revisiting Recurrent Reinforcement Learning with Memory Monoids

    Authors: Steven Morad, Chris Lu, Ryan Kortvelesy, Stephan Liwicki, Jakob Foerster, Amanda Prorok

    Abstract: Memory models such as Recurrent Neural Networks (RNNs) and Transformers address Partially Observable Markov Decision Processes (POMDPs) by map** trajectories to latent Markov states. Neither model scales particularly well to long sequences, especially compared to an emerging class of memory models sometimes called linear recurrent models. We discover that we can model the recurrent update of the… ▽ More

    Submitted 17 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  8. arXiv:2312.03488  [pdf, other

    cs.RO cs.MA

    Modeling Aggregate Downwash Forces for Dense Multirotor Flight

    Authors: Jennifer Gielis, Ajay Shankar, Ryan Kortvelesy, Amanda Prorok

    Abstract: Dense formation flight with multirotor swarms is a powerful, nature-inspired flight regime with numerous applications in the realworld. However, when multirotors fly in close vertical proximity to each other, the propeller downwash from the vehicles can have a destabilising effect on each other. Unfortunately, even in a homogeneous team, an accurate model of downwash forces from one vehicle is unl… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Presented at International Symposium on Experimental Robotics (ISER) 2023

  9. arXiv:2312.02372  [pdf, other

    eess.SP cs.LG

    On the Trade-Off between Stability and Representational Capacity in Graph Neural Networks

    Authors: Zhan Gao, Amanda Prorok, Elvin Isufi

    Abstract: Analyzing the stability of graph neural networks (GNNs) under topological perturbations is key to understanding their transferability and the role of each architecture component. However, stability has been investigated only for particular architectures, questioning whether it holds for a broader spectrum of GNNs or only for a few instances. To answer this question, we study the stability of EdgeN… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  10. arXiv:2312.01472  [pdf, other

    cs.LG cs.AI cs.MA

    BenchMARL: Benchmarking Multi-Agent Reinforcement Learning

    Authors: Matteo Bettini, Amanda Prorok, Vincent Moens

    Abstract: The field of Multi-Agent Reinforcement Learning (MARL) is currently facing a reproducibility crisis. While solutions for standardized reporting have been proposed to address the issue, we still lack a benchmarking tool that enables standardization and reproducibility, while leveraging cutting-edge Reinforcement Learning (RL) implementations. In this paper, we introduce BenchMARL, the first MARL tr… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  11. arXiv:2311.13988  [pdf, other

    cs.RO cs.LG eess.SY

    Docking Multirotors in Close Proximity using Learnt Downwash Models

    Authors: Ajay Shankar, Heedo Woo, Amanda Prorok

    Abstract: Unmodeled aerodynamic disturbances pose a key challenge for multirotor flight when multiple vehicles are in close proximity to each other. However, certain missions \textit{require} two multirotors to approach each other within 1-2 body-lengths of each other and hold formation -- we consider one such practical instance: vertically docking two multirotors in the air. In this leader-follower setting… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Presented at International Symposium on Experimental Robotics (ISER) 2023

  12. arXiv:2310.04128  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning with Fast and Forgetful Memory

    Authors: Steven Morad, Ryan Kortvelesy, Stephan Liwicki, Amanda Prorok

    Abstract: Nearly all real world tasks are inherently partially observable, necessitating the use of memory in Reinforcement Learning (RL). Most model-free approaches summarize the trajectory into a latent Markov state using memory models borrowed from Supervised Learning (SL), even though RL tends to exhibit different training and efficiency characteristics. Addressing this discrepancy, we introduce Fast an… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  13. arXiv:2306.13892  [pdf, other

    cs.LG cs.AI

    Differentially Private Decentralized Deep Learning with Consensus Algorithms

    Authors: Jasmine Bayrooti, Zhan Gao, Amanda Prorok

    Abstract: Cooperative decentralized deep learning relies on direct information exchange between communicating agents, each with access to a local dataset which should be kept private. The goal is for all agents to achieve consensus on model parameters after training. However, sharing parameters with untrustworthy neighboring agents could leak exploitable information about local datasets. To combat this, we… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  14. arXiv:2306.13826  [pdf, other

    cs.LG

    Generalised f-Mean Aggregation for Graph Neural Networks

    Authors: Ryan Kortvelesy, Steven Morad, Amanda Prorok

    Abstract: Graph Neural Network (GNN) architectures are defined by their implementations of update and aggregation modules. While many works focus on new ways to parametrise the update modules, the aggregation modules receive comparatively little attention. Because it is difficult to parametrise aggregation functions, currently most methods select a ``standard aggregator'' such as $\mathrm{mean}$,… ▽ More

    Submitted 10 October, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  15. arXiv:2305.18983  [pdf, other

    cs.RO cs.AI cs.LG

    SO(2)-Equivariant Downwash Models for Close Proximity Flight

    Authors: H. Smith, A. Shankar, J. Gielis, J. Blumenkamp, A. Prorok

    Abstract: Multirotors flying in close proximity induce aerodynamic wake effects on each other through propeller downwash. Conventional methods have fallen short of providing adequate 3D force-based models that can be incorporated into robust control paradigms for deploying dense formations. Thus, learning a model for these downwash patterns presents an attractive solution. In this paper, we present a novel… ▽ More

    Submitted 25 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Journal ref: Smith, H., Shankar, A., Gielis, J., Blumenkamp, J., & Prorok, A. IEEE Robotics and Automation Letters 9(2) (2024) 1174-1181

  16. arXiv:2305.11260  [pdf, other

    eess.SY cs.LG cs.MA cs.RO

    Constrained Environment Optimization for Prioritized Multi-Agent Navigation

    Authors: Zhan Gao, Amanda Prorok

    Abstract: Traditional approaches to the design of multi-agent navigation algorithms consider the environment as a fixed constraint, despite the influence of spatial constraints on agents' performance. Yet hand-designing conducive environment layouts is inefficient and potentially expensive. The goal of this paper is to consider the environment as a decision variable in a system-level optimization problem, w… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2209.11279

  17. arXiv:2305.02128  [pdf, other

    cs.MA cs.AI cs.LG cs.RO

    System Neural Diversity: Measuring Behavioral Heterogeneity in Multi-Agent Learning

    Authors: Matteo Bettini, Ajay Shankar, Amanda Prorok

    Abstract: Evolutionary science provides evidence that diversity confers resilience. Yet, traditional multi-agent reinforcement learning techniques commonly enforce homogeneity to increase training sample efficiency. When a system of learning agents is not constrained to homogeneous policies, individual agents may develop diverse behaviors, resulting in emergent complementarity that benefits the system. Desp… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  18. arXiv:2304.00790  [pdf, other

    cs.RO eess.SY

    LQR-CBF-RRT*: Safe and Optimal Motion Planning

    Authors: Guang Yang, Mingyu Cai, Ahmad Ahmad, Amanda Prorok, Roberto Tron, Calin Belta

    Abstract: We present LQR-CBF-RRT*, an incremental sampling-based algorithm for offline motion planning. Our framework leverages the strength of Control Barrier Functions (CBFs) and Linear Quadratic Regulators (LQR) to generate safety-critical and optimal trajectories for a robot with dynamics described by an affine control system. CBFs are used for safety guarantees, while LQRs are employed for optimal cont… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  19. arXiv:2303.04313  [pdf, other

    cs.RO cs.LG cs.MA

    Online Control Barrier Functions for Decentralized Multi-Agent Navigation

    Authors: Zhan Gao, Guang Yang, Amanda Prorok

    Abstract: Control barrier functions (CBFs) enable guaranteed safe multi-agent navigation in the continuous domain. The resulting navigation performance, however, is highly sensitive to the underlying hyperparameters. Traditional approaches consider fixed CBFs (where parameters are tuned apriori), and hence, typically do not perform well in cluttered and highly dynamic environments: conservative parameter va… ▽ More

    Submitted 8 September, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  20. arXiv:2303.01859  [pdf, other

    cs.LG cs.AI cs.RO

    POPGym: Benchmarking Partially Observable Reinforcement Learning

    Authors: Steven Morad, Ryan Kortvelesy, Matteo Bettini, Stephan Liwicki, Amanda Prorok

    Abstract: Real world applications of Reinforcement Learning (RL) are often partially observable, thus requiring memory. Despite this, partial observability is still largely ignored by contemporary RL benchmarks and libraries. We introduce Partially Observable Process Gym (POPGym), a two-part library containing (1) a diverse collection of 15 partially observable environments, each with multiple difficulties… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  21. arXiv:2302.12826  [pdf, other

    cs.LG cs.MA

    Permutation-Invariant Set Autoencoders with Fixed-Size Embeddings for Multi-Agent Learning

    Authors: Ryan Kortvelesy, Steven Morad, Amanda Prorok

    Abstract: The problem of permutation-invariant learning over set representations is particularly relevant in the field of multi-agent systems -- a few potential applications include unsupervised training of aggregation functions in graph neural networks (GNNs), neural cellular automata on graphs, and prediction of scenes with multiple objects. Yet existing approaches to set encoding and decoding tasks prese… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: AAMAS 2023

  22. arXiv:2301.08451  [pdf, other

    cs.AI cs.LG cs.MA cs.RO

    Accelerating Multi-Agent Planning Using Graph Transformers with Bounded Suboptimality

    Authors: Chenning Yu, Qingbiao Li, Sicun Gao, Amanda Prorok

    Abstract: Conflict-Based Search is one of the most popular methods for multi-agent path finding. Though it is complete and optimal, it does not scale well. Recent works have been proposed to accelerate it by introducing various heuristics. However, whether these heuristics can apply to non-grid-based problem settings while maintaining their effectiveness remains an open question. In this work, we find that… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted by ICRA 2023

  23. arXiv:2301.07137  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Heterogeneous Multi-Robot Reinforcement Learning

    Authors: Matteo Bettini, Ajay Shankar, Amanda Prorok

    Abstract: Cooperative multi-robot tasks can benefit from heterogeneity in the robots' physical and behavioral traits. In spite of this, traditional Multi-Agent Reinforcement Learning (MARL) frameworks lack the ability to explicitly accommodate policy heterogeneity, and typically constrain agents to share neural network parameters. This enforced homogeneity limits application in cases where the tasks benefit… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  24. arXiv:2210.16949  [pdf, other

    cs.IT cs.LG

    Decentralized Channel Management in WLANs with Graph Neural Networks

    Authors: Zhan Gao, Yulin Shao, Deniz Gunduz, Amanda Prorok

    Abstract: Wireless local area networks (WLANs) manage multiple access points (APs) and assign scarce radio frequency resources to APs for satisfying traffic demands of associated user devices. This paper considers the channel allocation problem in WLANs that minimizes the mutual interference among APs, and puts forth a learning-based solution that can be implemented in a decentralized manner. We formulate t… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  25. arXiv:2209.11279  [pdf, other

    cs.RO cs.LG cs.MA

    Environment Optimization for Multi-Agent Navigation

    Authors: Zhan Gao, Amanda Prorok

    Abstract: Traditional approaches to the design of multi-agent navigation algorithms consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance. Yet hand-designing improved environment layouts and structures is inefficient and potentially expensive. The goal of this paper is to consider the environment as a decision variable in a system-level o… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  26. arXiv:2208.00759  [pdf, other

    cs.RO cs.LG cs.MA eess.SY

    See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

    Authors: Jan Blumenkamp, Qingbiao Li, Binyu Wang, Zhe Liu, Amanda Prorok

    Abstract: We consider the problem of navigating a mobile robot towards a target in an unknown environment that is endowed with visual sensors, where neither the robot nor the sensors have access to global positioning information and only use first-person-view images. In order to overcome the need for positioning, we train the sensors to encode and communicate relevant viewpoint information to the mobile rob… ▽ More

    Submitted 31 July, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted to be presented at IROS 2023

  27. arXiv:2207.04505  [pdf, other

    cs.MA cs.RO eess.SY

    On the properties of path additions for traffic routing

    Authors: Matteo Bettini, Amanda Prorok

    Abstract: In this paper we investigate the impact of path additions to transport networks with optimised traffic routing. In particular, we study the behaviour of total travel time, and consider both self-interested routing paradigms, such as User Equilibrium (UE) routing, as well as cooperative paradigms, such as classic Multi-Commodity (MC) network flow and System Optimal (SO) routing. We provide a formal… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  28. arXiv:2207.03530  [pdf, other

    cs.RO cs.LG cs.MA

    VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning

    Authors: Matteo Bettini, Ryan Kortvelesy, Jan Blumenkamp, Amanda Prorok

    Abstract: While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention in the robotics community as a promising solution to tackle such problems. Nevertheless, we still lack the tools that allow us to quickly and efficiently find solutions to large-… ▽ More

    Submitted 17 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  29. arXiv:2206.09484  [pdf, ps, other

    cs.RO cs.MA

    A Critical Review of Communications in Multi-Robot Systems

    Authors: Jennifer Gielis, Ajay Shankar, Amanda Prorok

    Abstract: Purpose of Review. This review summarizes the broad roles that communication formats and technologies have played in enabling multi-robot systems. We approach this field from two perspectives: of robotic applications that need communication capabilities in order to accomplish tasks, and of networking technologies that have enabled newer and more advanced multi-robot systems. Recent Findings. Thr… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 9 pages excl. bibliography, 2 figures

  30. arXiv:2205.13005  [pdf, other

    cs.LG cs.MA

    QGNN: Value Function Factorisation with Graph Neural Networks

    Authors: Ryan Kortvelesy, Amanda Prorok

    Abstract: In multi-agent reinforcement learning, the use of a global objective is a powerful tool for incentivising cooperation. Unfortunately, it is not sample-efficient to train individual agents with a global reward, because it does not necessarily correlate with an agent's individual actions. This problem can be solved by factorising the global value function into local value functions. Early work in th… ▽ More

    Submitted 20 June, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  31. arXiv:2111.01777  [pdf, other

    cs.RO cs.LG cs.MA eess.SY

    A Framework for Real-World Multi-Robot Systems Running Decentralized GNN-Based Policies

    Authors: Jan Blumenkamp, Steven Morad, Jennifer Gielis, Qingbiao Li, Amanda Prorok

    Abstract: GNNs are a paradigm-shifting neural architecture to facilitate the learning of complex multi-agent behaviors. Recent work has demonstrated remarkable performance in tasks such as flocking, multi-agent path planning and cooperative coverage. However, the policies derived through GNN-based learning schemes have not yet been deployed to the real-world on physical multi-robot systems. In this work, we… ▽ More

    Submitted 28 February, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted at IEEE ICRA (International Conference on Robotics and Automation) 2022 Final Version (Camera Ready)

  32. arXiv:2110.05291  [pdf, other

    cs.LG stat.ML

    Graph Neural Network Guided Local Search for the Traveling Salesperson Problem

    Authors: Benjamin Hudson, Qingbiao Li, Matthew Malencia, Amanda Prorok

    Abstract: Solutions to the Traveling Salesperson Problem (TSP) have practical applications to processes in transportation, logistics, and automation, yet must be computed with minimal delay to satisfy the real-time nature of the underlying tasks. However, solving large TSP instances quickly without sacrificing solution quality remains challenging for current approximate algorithms. To close this gap, we pre… ▽ More

    Submitted 4 April, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

  33. arXiv:2109.14711  [pdf, other

    cs.LG cs.AI cs.RO

    Explanation-Aware Experience Replay in Rule-Dense Environments

    Authors: Francesco Sovrano, Alex Raymond, Amanda Prorok

    Abstract: Human environments are often regulated by explicit and complex rulesets. Integrating Reinforcement Learning (RL) agents into such environments motivates the development of learning mechanisms that perform well in rule-dense and exception-ridden environments such as autonomous driving on regulated roads. In this paper, we propose a method for organising experience by means of partitioning the exper… ▽ More

    Submitted 16 December, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE Robotics and Automation Letters (IEEE RA-L). Please cite the published version

    ACM Class: I.2.6; I.2.9

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 2, April 2022)

  34. arXiv:2109.12343  [pdf, other

    cs.RO cs.LG cs.MA eess.SY

    Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

    Authors: Amanda Prorok, Matthew Malencia, Luca Carlone, Gaurav S. Sukhatme, Brian M. Sadler, Vijay Kumar

    Abstract: Robustness is key to engineering, automation, and science as a whole. However, the property of robustness is often underpinned by costly requirements such as over-provisioning, known uncertainty and predictive models, and known adversaries. These conditions are idealistic, and often not satisfiable. Resilience on the other hand is the capability to endure unexpected disruptions, to recover swiftly… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  35. arXiv:2107.12254  [pdf, ps, other

    cs.RO cs.LG cs.MA

    The Holy Grail of Multi-Robot Planning: Learning to Generate Online-Scalable Solutions from Offline-Optimal Experts

    Authors: Amanda Prorok, Jan Blumenkamp, Qingbiao Li, Ryan Kortvelesy, Zhe Liu, Ethan Stump

    Abstract: Many multi-robot planning problems are burdened by the curse of dimensionality, which compounds the difficulty of applying solutions to large-scale problem instances. The use of learning-based methods in multi-robot planning holds great promise as it enables us to offload the online computational burden of expensive, yet optimal solvers, to an offline learning procedure. Simply put, the idea is to… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  36. Agree to Disagree: Subjective Fairness in Privacy-Restricted Decentralised Conflict Resolution

    Authors: Alex Raymond, Matthew Malencia, Guilherme Paulino-Passos, Amanda Prorok

    Abstract: Fairness is commonly seen as a property of the global outcome of a system and assumes centralisation and complete knowledge. However, in real decentralised applications, agents only have partial observation capabilities. Under limited information, agents rely on communication to divulge some of their private (and unobservable) information to others. When an agent deliberates to resolve conflicts,… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: 25 pages, 8 figures

    ACM Class: I.2.11; I.2.3; I.2.9

    Journal ref: Frontiers of Robotics and AI, 2022

  37. arXiv:2106.14117  [pdf, other

    cs.LG cs.AI cs.RO

    Graph Convolutional Memory using Topological Priors

    Authors: Steven D. Morad, Stephan Liwicki, Ryan Kortvelesy, Roberto Mecca, Amanda Prorok

    Abstract: Solving partially-observable Markov decision processes (POMDPs) is critical when applying reinforcement learning to real-world problems, where agents have an incomplete view of the world. We present graph convolutional memory (GCM), the first hybrid memory model for solving POMDPs using reinforcement learning. GCM uses either human-defined or data-driven topological priors to form graph neighborho… ▽ More

    Submitted 8 October, 2021; v1 submitted 26 June, 2021; originally announced June 2021.

  38. arXiv:2105.08601  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Graph Neural Networks for Decentralized Multi-Robot Submodular Action Selection

    Authors: Lifeng Zhou, Vishnu D. Sharma, Qingbiao Li, Amanda Prorok, Alejandro Ribeiro, Pratap Tokekar, Vijay Kumar

    Abstract: The problem of decentralized multi-robot target tracking asks for jointly selecting actions, e.g., motion primitives, for the robots to maximize target tracking performance with local communications. One major challenge for practical implementations is to make target tracking approaches scalable for large-scale problem instances. In this work, we propose a general-purpose learning architecture tow… ▽ More

    Submitted 14 September, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

  39. arXiv:2103.15660  [pdf, other

    cs.RO

    Pursuer Assignment and Control Strategies in Multi-agent Pursuit-Evasion Under Uncertainties

    Authors: Leiming Zhang, Amanda Prorok, Subhrajit Bhattacharya

    Abstract: We consider a pursuit-evasion problem with a heterogeneous team of multiple pursuers and multiple evaders. Although both the pursuers (robots) and the evaders are aware of each others' control and assignment strategies, they do not have exact information about the other type of agents' location or action. Using only noisy on-board sensors the pursuers (or evaders) make probabilistic estimation of… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 14 pages, 6 figures

  40. arXiv:2103.13446  [pdf, other

    cs.LG cs.MA cs.RO

    ModGNN: Expert Policy Approximation in Multi-Agent Systems with a Modular Graph Neural Network Architecture

    Authors: Ryan Kortvelesy, Amanda Prorok

    Abstract: Recent work in the multi-agent domain has shown the promise of Graph Neural Networks (GNNs) to learn complex coordination strategies. However, most current approaches use minor variants of a Graph Convolutional Network (GCN), which applies a convolution to the communication graph formed by the multi-agent system. In this paper, we investigate whether the performance and generalization of GCNs can… ▽ More

    Submitted 24 February, 2023; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: ICRA 2021

  41. arXiv:2103.02142  [pdf, other

    cs.RO cs.LG

    Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control

    Authors: Jacopo Panerati, Hehui Zheng, SiQi Zhou, James Xu, Amanda Prorok, Angela P. Schoellig

    Abstract: Robotic simulators are crucial for academic research and education as well as the development of safety-critical applications. Reinforcement learning environments -- simple simulations coupled with a problem specification in the form of a reward function -- are also important to standardize the development (and benchmarking) of learning algorithms. Yet, full-scale simulators typically lack portabi… ▽ More

    Submitted 25 July, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: 8 pages, 11 figures, accepted for presentation at IROS 2021

    ACM Class: I.2.6; I.2.9

  42. arXiv:2102.06265  [pdf, other

    cs.RO cs.MA

    Fair Robust Assignment using Redundancy

    Authors: Matthew Malencia, Vijay Kumar, George Pappas, Amanda Prorok

    Abstract: We study the consideration of fairness in redundant assignment for multi-agent task allocation. It has recently been shown that redundant assignment of agents to tasks provides robustness to uncertainty in task performance. However, the question of how to fairly assign these redundant resources across tasks remains unaddressed. In this paper, we present a novel problem formulation for fair redunda… ▽ More

    Submitted 5 March, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: (c) 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  43. arXiv:2012.14906  [pdf, other

    cs.LG eess.SP eess.SY

    Synthesizing Decentralized Controllers with Graph Neural Networks and Imitation Learning

    Authors: Fernando Gama, Qingbiao Li, Ekaterina Tolstaya, Amanda Prorok, Alejandro Ribeiro

    Abstract: Dynamical systems consisting of a set of autonomous agents face the challenge of having to accomplish a global task, relying only on local information. While centralized controllers are readily available, they face limitations in terms of scalability and implementation, as they do not respect the distributed information structure imposed by the network system of agents. Given the difficulties in f… ▽ More

    Submitted 23 March, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

  44. arXiv:2012.00508  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Gaussian Process Based Message Filtering for Robust Multi-Agent Cooperation in the Presence of Adversarial Communication

    Authors: Rupert Mitchell, Jan Blumenkamp, Amanda Prorok

    Abstract: In this paper, we consider the problem of providing robustness to adversarial communication in multi-agent systems. Specifically, we propose a solution towards robust cooperation, which enables the multi-agent system to maintain high performance in the presence of anonymous non-cooperative agents that communicate faulty, misleading or manipulative information. In pursuit of this goal, we propose a… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  45. arXiv:2011.13219  [pdf, other

    cs.RO cs.DC cs.LG cs.MA stat.ML

    Message-Aware Graph Attention Networks for Large-Scale Multi-Robot Path Planning

    Authors: Qingbiao Li, Weizhe Lin, Zhe Liu, Amanda Prorok

    Abstract: The domains of transport and logistics are increasingly relying on autonomous mobile robots for the handling and distribution of passengers or resources. At large system scales, finding decentralized path planning and coordination solutions is key to efficient system performance. Recently, Graph Neural Networks (GNNs) have become popular due to their ability to learn communication policies in dece… ▽ More

    Submitted 25 April, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: This work has been accepted to the IEEE Robotics and Automation Letters (RA-L) for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  46. arXiv:2008.02616  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning

    Authors: Jan Blumenkamp, Amanda Prorok

    Abstract: Many real-world problems require the coordination of multiple autonomous agents. Recent work has shown the promise of Graph Neural Networks (GNNs) to learn explicit communication strategies that enable complex multi-agent coordination. These works use models of cooperative multi-agent systems whereby agents strive to achieve a shared global goal. When considering agents with self-interested local… ▽ More

    Submitted 4 November, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted to Conference on Robot Learning (CoRL) 2020. Camera-ready version incorporating rebuttal

  47. arXiv:2005.05420  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Mobile Robot Path Planning in Dynamic Environments through Globally Guided Reinforcement Learning

    Authors: Binyu Wang, Zhe Liu, Qingbiao Li, Amanda Prorok

    Abstract: Path planning for mobile robots in large dynamic environments is a challenging problem, as the robots are required to efficiently reach their given goals while simultaneously avoiding potential conflicts with other robots or dynamic objects. In the presence of dynamic obstacles, traditional solutions usually employ re-planning strategies, which re-call a planning algorithm to search for an alterna… ▽ More

    Submitted 11 September, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: 8 pages, 4 figures

  48. arXiv:2005.04995  [pdf, other

    eess.SY cs.RO

    Effects of Controller Heterogeneity on Autonomous Vehicle Traffic

    Authors: Matthew Le Maitre, Amanda Prorok

    Abstract: Interactions between road users are both highly non-linear and profoundly complex, and there is no reason to expect that interactions between autonomous vehicles will be any different. Given the recent rapid development of autonomous vehicle technologies, we need to understand how these interactions are likely to present themselves, and what their implications might be. This paper looks into the i… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: Accepted at ITSC 2020 (The 23rd IEEE International Conference on Intelligent Transportation Systems)

  49. arXiv:1912.06095  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Graph Neural Networks for Decentralized Multi-Robot Path Planning

    Authors: Qingbiao Li, Fernando Gama, Alejandro Ribeiro, Amanda Prorok

    Abstract: Effective communication is key to successful, decentralized, multi-robot path planning. Yet, it is far from obvious what information is crucial to the task at hand, and how and when it must be shared among robots. To side-step these issues and move beyond hand-crafted heuristics, we propose a combined model that automatically synthesizes local communication and decision-making policies for robots… ▽ More

    Submitted 14 July, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: This paper has been accepted in the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020. For the simulation demo, see this https URL "https://youtu.be/AGDk2RozpMQ"

  50. arXiv:1911.11699  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving

    Authors: Rupert Mitchell, Jenny Fletcher, Jacopo Panerati, Amanda Prorok

    Abstract: Autonomous driving promises to transform road transport. Multi-vehicle and multi-lane scenarios, however, present unique challenges due to constrained navigation and unpredictable vehicle interactions. Learning-based methods---such as deep reinforcement learning---are emerging as a promising approach to automatically design intelligent driving policies that can cope with these challenges. Yet, the… ▽ More

    Submitted 10 February, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: 10 pages, 8 figures

    ACM Class: I.2.6; I.2.9