Skip to main content

Showing 1–6 of 6 results for author: Tessera, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.08466  [pdf, other

    cs.AI

    Efficiently Quantifying Individual Agent Importance in Cooperative MARL

    Authors: Omayma Mahjoub, Ruan de Kock, Siddarth Singh, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Measuring the contribution of individual agents is challenging in cooperative multi-agent reinforcement learning (MARL). In cooperative MARL, team performance is typically inferred from a single shared global reward. Arguably, among the best current approaches to effectively measure individual agent contributions is to use Shapley values. However, calculating these values is expensive as the compu… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 8 pages, AAAI XAI4DRL workshop 2023; references updated, figure 8 style updated, typos

    MSC Class: I.2.11; I.2.0; A.0

  2. arXiv:2312.08463  [pdf, other

    cs.AI

    How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning

    Authors: Siddarth Singh, Omayma Mahjoub, Ruan de Kock, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Establishing sound experimental standards and rigour is important in any growing field of research. Deep Multi-Agent Reinforcement Learning (MARL) is one such nascent field. Although exciting progress has been made, MARL has recently come under scrutiny for replicability issues and a lack of standardised evaluation methodology, specifically in the cooperative setting. Although protocols have been… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 6 pages, AAAI XAI4DRL workshop 2023; typos corrected, images updated, page count updated

    MSC Class: I.2.11; I.2.0; A.0

  3. arXiv:2311.18598  [pdf, other

    cs.LG cs.AI cs.MA

    Generalisable Agents for Neural Network Optimisation

    Authors: Kale-ab Tessera, Callum Rhys Tilbury, Sasha Abramowitz, Ruan de Kock, Omayma Mahjoub, Benjamin Rosman, Sara Hooker, Arnu Pretorius

    Abstract: Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long training times. To address this difficulty, we propose the framework of Generalisable Agents for Neural Network Optimisation (GANNO) -- a multi-agent reinforcement learning (MARL) approach that learns to improve neural network optimisation by dynamically and responsivel… ▽ More

    Submitted 22 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Workshop on Advanced Neural Network Training (WANT) and Optimization for Machine Learning (OPT) at NeurIPS 2023

  4. arXiv:2304.00977  [pdf, other

    cs.AI cs.LG cs.MA

    Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning

    Authors: Claude Formanek, Callum Rhys Tilbury, Jonathan Shock, Kale-ab Tessera, Arnu Pretorius

    Abstract: 'Reincarnation' in reinforcement learning has been proposed as a formalisation of reusing prior computation from past experiments when training an agent in an environment. In this paper, we present a brief foray into the paradigm of reincarnation in the multi-agent (MA) context. We consider the case where only some agents are reincarnated, whereas the others are trained from scratch -- selective r… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: Accepted as oral presentation at Reincarnating Reinforcement Learning workshop at ICLR 2023

  5. arXiv:2111.03904  [pdf, other

    cs.LG stat.AP

    On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

    Authors: Ibrahim Salihu Yusuf, Kale-ab Tessera, Thomas Tumiel, Zohra Slim, Amine Kerkeni, Sella Nevo, Arnu Pretorius

    Abstract: Desert locust outbreaks threaten the food security of a large part of Africa and have affected the livelihoods of millions of people over the years. Machine learning (ML) has been demonstrated as an effective approach to locust distribution modelling which could assist in early warning. ML requires a significant amount of labelled data to train. Most publicly available labelled data on locusts are… ▽ More

    Submitted 20 May, 2022; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: AI for Humanitarian Assistance and Disaster Response (AI+HADR) workshop, NeurIPS 2021

  6. arXiv:2102.01670  [pdf, other

    cs.LG cs.CV

    Keep the Gradients Flowing: Using Gradient Flow to Study Sparse Network Optimization

    Authors: Kale-ab Tessera, Sara Hooker, Benjamin Rosman

    Abstract: Training sparse networks to converge to the same performance as dense neural architectures has proven to be elusive. Recent work suggests that initialization is the key. However, while this direction of research has had some success, focusing on initialization alone appears to be inadequate. In this paper, we take a broader view of training sparse networks and consider the role of regularization,… ▽ More

    Submitted 15 June, 2021; v1 submitted 2 February, 2021; originally announced February 2021.