Skip to main content

Showing 1–29 of 29 results for author: Pretorius, A

.
  1. arXiv:2407.01343  [pdf, other

    cs.LG cs.AI cs.MA

    Coordination Failure in Cooperative Offline MARL

    Authors: Callum Rhys Tilbury, Claude Formanek, Louise Beyers, Jonathan P. Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) leverages static datasets of experience to learn optimal multi-agent control. However, learning from static data presents several unique challenges to overcome. In this paper, we focus on coordination failure and investigate the role of joint actions in multi-agent policy gradients with offline data, focusing on a common setting we refer to as the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted at the Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET) at the International Conference on Machine Learning, 2024

  2. arXiv:2406.16424  [pdf, other

    cs.AI cs.LG

    Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization

    Authors: Felix Chalumeau, Refiloe Shabe, Noah de Nicola, Arnu Pretorius, Thomas D. Barrett, Nathan Grinsztajn

    Abstract: Combinatorial Optimization is crucial to numerous real-world applications, yet still presents challenges due to its (NP-)hard nature. Amongst existing approaches, heuristics often offer the best trade-off between quality and scalability, making them suitable for industrial use. While Reinforcement Learning (RL) offers a flexible framework for designing heuristics, its adoption over handcrafted heu… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.09068  [pdf, other

    cs.LG cs.AI

    Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation

    Authors: Claude Formanek, Callum Rhys Tilbury, Louise Beyers, Jonathan Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) is an emerging field with great promise for real-world applications. Unfortunately, the current state of research in offline MARL is plagued by inconsistencies in baselines and evaluation protocols, which ultimately makes it difficult to accurately assess progress, trust newly proposed innovations, and allow researchers to easily build upon prior w… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2403.06860  [pdf, other

    cs.LG cs.CV

    A Geospatial Approach to Predicting Desert Locust Breeding Grounds in Africa

    Authors: Ibrahim Salihu Yusuf, Mukhtar Opeyemi Yusuf, Kobby Panford-Quainoo, Arnu Pretorius

    Abstract: Desert locust swarms present a major threat to agriculture and food security. Addressing this challenge, our study develops an operationally-ready model for predicting locust breeding grounds, which has the potential to enhance early warning systems and targeted control measures. We curated a dataset from the United Nations Food and Agriculture Organization's (UN-FAO) locust observation records an… ▽ More

    Submitted 21 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  5. arXiv:2312.08468  [pdf, other

    cs.AI

    On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL

    Authors: Wiem Khlifi, Siddarth Singh, Omayma Mahjoub, Ruan de Kock, Abidine Vall, Rihab Gorsane, Arnu Pretorius

    Abstract: Cooperative multi-agent reinforcement learning (MARL) has made substantial strides in addressing the distributed decision-making challenges. However, as multi-agent systems grow in complexity, gaining a comprehensive understanding of their behaviour becomes increasingly challenging. Conventionally, tracking team rewards over time has served as a pragmatic measure to gauge the effectiveness of agen… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 4 pages, AAAI XAI4DRL workshop 2023

    MSC Class: I.2.11; I.2.0; A.0

  6. arXiv:2312.08466  [pdf, other

    cs.AI

    Efficiently Quantifying Individual Agent Importance in Cooperative MARL

    Authors: Omayma Mahjoub, Ruan de Kock, Siddarth Singh, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Measuring the contribution of individual agents is challenging in cooperative multi-agent reinforcement learning (MARL). In cooperative MARL, team performance is typically inferred from a single shared global reward. Arguably, among the best current approaches to effectively measure individual agent contributions is to use Shapley values. However, calculating these values is expensive as the compu… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 8 pages, AAAI XAI4DRL workshop 2023; references updated, figure 8 style updated, typos

    MSC Class: I.2.11; I.2.0; A.0

  7. arXiv:2312.08463  [pdf, other

    cs.AI

    How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning

    Authors: Siddarth Singh, Omayma Mahjoub, Ruan de Kock, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Establishing sound experimental standards and rigour is important in any growing field of research. Deep Multi-Agent Reinforcement Learning (MARL) is one such nascent field. Although exciting progress has been made, MARL has recently come under scrutiny for replicability issues and a lack of standardised evaluation methodology, specifically in the cooperative setting. Although protocols have been… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 6 pages, AAAI XAI4DRL workshop 2023; typos corrected, images updated, page count updated

    MSC Class: I.2.11; I.2.0; A.0

  8. arXiv:2311.18598  [pdf, other

    cs.LG cs.AI cs.MA

    Generalisable Agents for Neural Network Optimisation

    Authors: Kale-ab Tessera, Callum Rhys Tilbury, Sasha Abramowitz, Ruan de Kock, Omayma Mahjoub, Benjamin Rosman, Sara Hooker, Arnu Pretorius

    Abstract: Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long training times. To address this difficulty, we propose the framework of Generalisable Agents for Neural Network Optimisation (GANNO) -- a multi-agent reinforcement learning (MARL) approach that learns to improve neural network optimisation by dynamically and responsivel… ▽ More

    Submitted 22 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Workshop on Advanced Neural Network Training (WANT) and Optimization for Machine Learning (OPT) at NeurIPS 2023

  9. arXiv:2311.17371  [pdf, other

    cs.CL cs.AI

    Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs

    Authors: Andries Smit, Paul Duckworth, Nathan Grinsztajn, Thomas D. Barrett, Arnu Pretorius

    Abstract: Recent advancements in large language models (LLMs) underscore their potential for responding to inquiries in various domains. However, ensuring that generative agents provide accurate and reliable answers remains an ongoing challenge. In this context, multi-agent debate (MAD) has emerged as a promising strategy for enhancing the truthfulness of LLMs. We benchmark a range of debating and prompting… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 2 pages, 13 figures

  10. arXiv:2311.13569  [pdf, other

    cs.LG cs.AI

    Combinatorial Optimization with Policy Adaptation using Latent Space Search

    Authors: Felix Chalumeau, Shikha Surana, Clement Bonnet, Nathan Grinsztajn, Arnu Pretorius, Alexandre Laterre, Thomas D. Barrett

    Abstract: Combinatorial Optimization underpins many real-world applications and yet, designing performant algorithms to solve these complex, typically NP-hard, problems remains a significant research challenge. Reinforcement Learning (RL) provides a versatile framework for designing heuristics across a broad spectrum of problem domains. However, despite notable progress, RL has not yet supplanted industrial… ▽ More

    Submitted 28 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Fix typo in formula and add a reference

  11. arXiv:2306.09884  [pdf, other

    cs.LG cs.AI

    Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

    Authors: Clément Bonnet, Daniel Luo, Donal Byrne, Shikha Surana, Sasha Abramowitz, Paul Duckworth, Vincent Coyette, Laurence I. Midgley, Elshadai Tegegn, Tristan Kalloniatis, Omayma Mahjoub, Matthew Macfarlane, Andries P. Smit, Nathan Grinsztajn, Raphael Boige, Cemlyn N. Waters, Mohamed A. Mimouni, Ulrich A. Mbou Sob, Ruan de Kock, Siddarth Singh, Daniel Furelos-Blanco, Victor Le, Arnu Pretorius, Alexandre Laterre

    Abstract: Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments speci… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 9 pages + 21 pages of appendices and references. Published at ICLR 2024

  12. arXiv:2304.00977  [pdf, other

    cs.AI cs.LG cs.MA

    Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning

    Authors: Claude Formanek, Callum Rhys Tilbury, Jonathan Shock, Kale-ab Tessera, Arnu Pretorius

    Abstract: 'Reincarnation' in reinforcement learning has been proposed as a formalisation of reusing prior computation from past experiments when training an agent in an environment. In this paper, we present a brief foray into the paradigm of reincarnation in the multi-agent (MA) context. We consider the case where only some agents are reincarnated, whereas the others are trained from scratch -- selective r… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: Accepted as oral presentation at Reincarnating Reinforcement Learning workshop at ICLR 2023

  13. arXiv:2302.00521  [pdf, other

    cs.LG cs.AI cs.MA

    Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement Learning

    Authors: Claude Formanek, Asad Jeewa, Jonathan Shock, Arnu Pretorius

    Abstract: Being able to harness the power of large datasets for develo** cooperative multi-agent controllers promises to unlock enormous value for real-world applications. Many important industrial systems are multi-agent in nature and are difficult to model using bespoke simulators. However, in industry, distributed processes can often be recorded during operation, and large quantities of demonstrative d… ▽ More

    Submitted 22 September, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Extended Abstract at Autonomous Agents and Multi-Agent Systems Conference 2023

  14. arXiv:2209.10485  [pdf, other

    cs.LG cs.AI cs.GL cs.MA

    Towards a Standardised Performance Evaluation Protocol for Cooperative MARL

    Authors: Rihab Gorsane, Omayma Mahjoub, Ruan de Kock, Roland Dubb, Siddarth Singh, Arnu Pretorius

    Abstract: Multi-agent reinforcement learning (MARL) has emerged as a useful approach to solving decentralised decision-making problems at scale. Research in the field has been growing steadily with many breakthrough algorithms proposed in recent years. In this work, we take a closer look at this rapid development with a focus on evaluation methodologies employed across a large body of research in cooperativ… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Published at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). Website: see https://sites.google.com/view/marl-standard-protocol . 43 Pages, 21 Figures, 8 Tables

    ACM Class: I.2.11; I.2.0; A.1

  15. arXiv:2206.06758  [pdf, other

    cs.MA cs.DM cs.LG

    Universally Expressive Communication in Multi-Agent Reinforcement Learning

    Authors: Matthew Morris, Thomas D. Barrett, Arnu Pretorius

    Abstract: Allowing agents to share information through communication is crucial for solving complex tasks in multi-agent reinforcement learning. In this work, we consider the question of whether a given communication protocol can express an arbitrary policy. By observing that many existing protocols can be viewed as instances of graph neural networks (GNNs), we demonstrate the equivalence of joint action se… ▽ More

    Submitted 13 January, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Published in NeurIPS 2022

    MSC Class: 68T07; 68T42; 68R10 (Primary) 68T20; 05C15 (Secondary) ACM Class: I.2.11; I.2.6; I.2.8

  16. arXiv:2111.06721  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Multi-Agent Reinforcement Learning: Review and Open Problems

    Authors: St John Grimbly, Jonathan Shock, Arnu Pretorius

    Abstract: This paper serves to introduce the reader to the field of multi-agent reinforcement learning (MARL) and its intersection with methods from the study of causality. We highlight key challenges in MARL and discuss these in the context of how causal methods may assist in tackling them. We promote moving toward a 'causality first' perspective on MARL. Specifically, we argue that causality can offer imp… ▽ More

    Submitted 1 December, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted at Cooperative AI Workshop, NeurIPS 2021

  17. arXiv:2111.03904  [pdf, other

    cs.LG stat.AP

    On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

    Authors: Ibrahim Salihu Yusuf, Kale-ab Tessera, Thomas Tumiel, Zohra Slim, Amine Kerkeni, Sella Nevo, Arnu Pretorius

    Abstract: Desert locust outbreaks threaten the food security of a large part of Africa and have affected the livelihoods of millions of people over the years. Machine learning (ML) has been demonstrated as an effective approach to locust distribution modelling which could assist in early warning. ML requires a significant amount of labelled data to train. Most publicly available labelled data on locusts are… ▽ More

    Submitted 20 May, 2022; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: AI for Humanitarian Assistance and Disaster Response (AI+HADR) workshop, NeurIPS 2021

  18. arXiv:2111.02827  [pdf, other

    cs.CL cs.LG

    Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel

    Authors: Kevin Eloff, Okko Räsänen, Herman A. Engelbrecht, Arnu Pretorius, Herman Kamper

    Abstract: Multi-agent reinforcement learning has been used as an effective means to study emergent communication between agents, yet little focus has been given to continuous acoustic communication. This would be more akin to human language acquisition; human infants acquire language in large part through continuous signalling with their caregivers. We therefore ask: Are we able to observe emergent language… ▽ More

    Submitted 2 May, 2023; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 10 pages, 3 figures, 6 tables

  19. arXiv:2110.05167  [pdf, other

    stat.ML cs.LG

    Robust and Scalable SDE Learning: A Functional Perspective

    Authors: Scott Cameron, Tyron Cameron, Arnu Pretorius, Stephen Roberts

    Abstract: Stochastic differential equations provide a rich class of flexible generative models, capable of describing a wide range of spatio-temporal processes. A host of recent work looks to learn data-representing SDEs, using neural networks and other flexible function approximators. Despite these advances, learning remains computationally expensive due to the sequential nature of SDE integrators. In this… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  20. arXiv:2107.01460  [pdf, other

    cs.LG cs.MA

    Mava: a research library for distributed multi-agent reinforcement learning in JAX

    Authors: Ruan de Kock, Omayma Mahjoub, Sasha Abramowitz, Wiem Khlifi, Callum Rhys Tilbury, Claude Formanek, Andries Smit, Arnu Pretorius

    Abstract: Multi-agent reinforcement learning (MARL) research is inherently computationally expensive and it is often difficult to obtain a sufficient number of experiment samples to test hypotheses and make robust statistical claims. Furthermore, MARL algorithms are typically complex in their design and can be tricky to implement correctly. These aspects of MARL present a difficult challenge when it comes t… ▽ More

    Submitted 15 December, 2023; v1 submitted 3 July, 2021; originally announced July 2021.

  21. arXiv:2010.07777  [pdf, other

    cs.LG cs.GT cs.MA

    A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

    Authors: Arnu Pretorius, Scott Cameron, Elan van Biljon, Tom Makkink, Shahil Mawjee, Jeremy du Plessis, Jonathan Shock, Alexandre Laterre, Karim Beguir

    Abstract: Multi-agent reinforcement learning has recently shown great promise as an approach to networked system control. Arguably, one of the most difficult and important tasks for which large scale networked system control is applicable is common-pool resource management. Crucial common-pool resources include arable land, fresh water, wetlands, wildlife, fish stock, forests and the atmosphere, of which pr… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 17 pages, 16 Figures, to appear in Advances of Neural Information Processing Systems (NeurIPS) conference, 2020

  22. arXiv:2004.04418  [pdf, other

    cs.CL cs.LG

    On Optimal Transformer Depth for Low-Resource Language Translation

    Authors: Elan van Biljon, Arnu Pretorius, Julia Kreutzer

    Abstract: Transformers have shown great promise as an approach to Neural Machine Translation (NMT) for low-resource languages. However, at the same time, transformer models remain difficult to optimize and require careful tuning of hyper-parameters to be useful in this setting. Many NMT toolkits come with a set of default hyper-parameters, which researchers and practitioners often adopt for the sake of conv… ▽ More

    Submitted 14 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

  23. arXiv:2001.06178  [pdf, other

    cs.LG stat.ML

    DNNs as Layers of Cooperating Classifiers

    Authors: Marelie H. Davel, Marthinus W. Theunissen, Arnold M. Pretorius, Etienne Barnard

    Abstract: A robust theoretical framework that can describe and predict the generalization ability of deep neural networks (DNNs) in general circumstances remains elusive. Classical attempts have produced complexity metrics that rely heavily on global measures of compactness and capacity with little investigation into the effects of sub-component collaboration. We demonstrate intriguing regularities in the a… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Accepted at AAAI-2020. The preprint contains additional figures and an appendix not included in the conference version. Main text remains unchanged

  24. arXiv:1910.10386  [pdf, other

    cs.LG stat.ML

    Stabilising priors for robust Bayesian deep learning

    Authors: Felix McGregor, Arnu Pretorius, Johan du Preez, Steve Kroon

    Abstract: Bayesian neural networks (BNNs) have developed into useful tools for probabilistic modelling due to recent advances in variational inference enabling large scale BNNs. However, BNNs remain brittle and hard to train, especially: (1) when using deep architectures consisting of many hidden layers and (2) in situations with large weight variances. We use signal propagation theory to quantify these cha… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 3 pages, accepted at Bayesian Deep learning workshop NeurIPS 2019

  25. arXiv:1910.05725  [pdf, other

    stat.ML cs.LG

    If dropout limits trainable depth, does critical initialisation still matter? A large-scale statistical analysis on ReLU networks

    Authors: Arnu Pretorius, Elan van Biljon, Benjamin van Niekerk, Ryan Eloff, Matthew Reynard, Steve James, Benjamin Rosman, Herman Kamper, Steve Kroon

    Abstract: Recent work in signal propagation theory has shown that dropout limits the depth to which information can propagate through a neural network. In this paper, we investigate the effect of initialisation on training speed and generalisation for ReLU networks within this depth limit. We ask the following research question: given that critical initialisation is crucial for training at large depth, if d… ▽ More

    Submitted 20 February, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, under consideration at Pattern Recognition Letters

  26. On the expected behaviour of noise regularised deep neural networks as Gaussian processes

    Authors: Arnu Pretorius, Herman Kamper, Steve Kroon

    Abstract: Recent work has established the equivalence between deep neural networks and Gaussian processes (GPs), resulting in so-called neural network Gaussian processes (NNGPs). The behaviour of these models depends on the initialisation of the corresponding network. In this work, we consider the impact of noise regularisation (e.g. dropout) on NNGPs, and relate their behaviour to signal propagation theory… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, preliminary work

    Journal ref: Pattern Recognition Letters 138 (2020) 75-81

  27. arXiv:1904.07556  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks

    Authors: Ryan Eloff, André Nortje, Benjamin van Niekerk, Avashna Govender, Leanne Nortje, Arnu Pretorius, Elan van Biljon, Ewald van der Westhuizen, Lisa van Staden, Herman Kamper

    Abstract: For our submission to the ZeroSpeech 2019 challenge, we apply discrete latent-variable neural networks to unlabelled speech and use the discovered units for speech synthesis. Unsupervised discrete subword modelling could be useful for studies of phonetic category learning in infants or in low-resource speech technology requiring symbolic input. We use an autoencoder (AE) architecture with intermed… ▽ More

    Submitted 28 June, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: Interspeech 2019

  28. arXiv:1811.00293  [pdf, other

    stat.ML cs.LG

    Critical initialisation for deep signal propagation in noisy rectifier neural networks

    Authors: Arnu Pretorius, Elan Van Biljon, Steve Kroon, Herman Kamper

    Abstract: Stochastic regularisation is an important weapon in the arsenal of a deep learning practitioner. However, despite recent theoretical advances, our understanding of how noise influences signal propagation in deep neural networks remains limited. By extending recent work based on mean field theory, we develop a new framework for signal propagation in stochastic regularised neural networks. Our noisy… ▽ More

    Submitted 30 November, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 20 pages, 11 figures, accepted at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

  29. arXiv:1806.05413  [pdf, other

    stat.ML cs.LG

    Learning Dynamics of Linear Denoising Autoencoders

    Authors: Arnu Pretorius, Steve Kroon, Herman Kamper

    Abstract: Denoising autoencoders (DAEs) have proven useful for unsupervised representation learning, but a thorough theoretical understanding is still lacking of how the input noise influences learning. Here we develop theory for how noise influences learning in DAEs. By focusing on linear DAEs, we are able to derive analytic expressions that exactly describe their learning dynamics. We verify our theoretic… ▽ More

    Submitted 29 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: 14 pages, 7 figures, accepted at the 35th International Conference on Machine Learning (ICML) 2018