Skip to main content

Showing 1–50 of 75 results for author: Witt, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12137  [pdf, other

    cs.AI

    IDs for AI Systems

    Authors: Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung

    Abstract: AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. A user may not be able to verify whether a system satisfies certain safety standards. An investigator may not know whom to investigate when a system causes an incident. A platform may find it difficult to penalize repeated negative interactions with the same s… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2406.04899  [pdf, ps, other

    cs.NE cs.AI

    Sliding Window 3-Objective Pareto Optimization for Problems with Chance Constraints

    Authors: Frank Neumann, Carsten Witt

    Abstract: Constrained single-objective problems have been frequently tackled by evolutionary multi-objective algorithms where the constraint is relaxed into an additional objective. Recently, it has been shown that Pareto optimization approaches using bi-objective models can be significantly sped up using sliding windows (Neumann and Witt, ECAI 2023). In this paper, we extend the sliding window approach to… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: To appear at PPSN 2024

  3. arXiv:2406.02619  [pdf, other

    cs.CR cs.LG

    Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits

    Authors: Andis Draguns, Andrew Gritsevskiy, Sumeet Ramesh Motwani, Charlie Rogers-Smith, Jeffrey Ladish, Christian Schroeder de Witt

    Abstract: The rapid proliferation of open-source language models significantly increases the risks of downstream backdoor attacks. These backdoors can introduce dangerous behaviours during model deployment and can evade detection by conventional cybersecurity monitoring systems. In this paper, we introduce a novel class of backdoors in autoregressive transformer models, that, in contrast to prior art, are u… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

  4. arXiv:2405.19540  [pdf, other

    cs.IT cs.CR

    Computing Low-Entropy Couplings for Large-Support Distributions

    Authors: Samuel Sokota, Dylan Sam, Christian Schroeder de Witt, Spencer Compton, Jakob Foerster, J. Zico Kolter

    Abstract: Minimum-entropy coupling (MEC) -- the process of finding a joint distribution with minimum entropy for given marginals -- has applications in areas such as causality and steganography. However, existing algorithms are either computationally intractable for large-support distributions or limited to specific distribution types and sensitive to hyperparameter choices. This work addresses these limita… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  6. arXiv:2404.11239  [pdf, ps, other

    cs.NE

    Runtime Analysis of a Multi-Valued Compact Genetic Algorithm on Generalized OneMax

    Authors: Sumit Adak, Carsten Witt

    Abstract: A class of metaheuristic techniques called estimation-of-distribution algorithms (EDAs) are employed in optimization as more sophisticated substitutes for traditional strategies like evolutionary algorithms. EDAs generally drive the search for the optimum by creating explicit probabilistic models of potential candidate solutions through repeated sampling and selection from the underlying search sp… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2404.07099  [pdf, other

    cs.LG cs.AI

    Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection

    Authors: Linas Nasvytis, Kai Sandbrink, Jakob Foerster, Tim Franzmeyer, Christian Schroeder de Witt

    Abstract: While reinforcement learning (RL) algorithms have been successfully applied across numerous sequential decision-making problems, their generalization to unforeseen testing environments remains a significant concern. In this paper, we study the problem of out-of-distribution (OOD) detection in RL, which focuses on identifying situations at test time that RL agents have not encountered in their trai… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted as a full paper to the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)

  8. A Flexible Evolutionary Algorithm With Dynamic Mutation Rate Archive

    Authors: Martin S. Krejca, Carsten Witt

    Abstract: We propose a new, flexible approach for dynamically maintaining successful mutation rates in evolutionary algorithms using $k$-bit flip mutations. The algorithm adds successful mutation rates to an archive of promising rates that are favored in subsequent steps. Rates expire when their number of unsuccessful trials has exceeded a threshold, while rates currently not present in the archive can ente… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. arXiv:2402.07510  [pdf, other

    cs.AI cs.CR

    Secret Collusion Among Generative AI Agents

    Authors: Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H. S. Torr, Lewis Hammond, Christian Schroeder de Witt

    Abstract: Recent capability increases in large language models (LLMs) open up applications in which teams of communicating generative AI agents solve joint tasks. This poses privacy and security challenges concerning the unauthorised sharing of information, or other unwanted forms of agent coordination. Modern steganographic techniques could render such dynamics hard to detect. In this paper, we comprehensi… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  10. arXiv:2402.01088  [pdf, other

    cs.GT cs.MA

    The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

    Authors: Jake Levi, Chris Lu, Timon Willi, Christian Schroeder de Witt, Jakob Foerster

    Abstract: The increasing prevalence of multi-agent learning systems in society necessitates understanding how to learn effective and safe policies in general-sum multi-agent environments against a variety of opponents, including self-play. General-sum learning is difficult because of non-stationary opponents and misaligned incentives. Our first main contribution is to show that many recent approaches to gen… ▽ More

    Submitted 27 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 31 pages, 23 figures

  11. arXiv:2311.10090  [pdf, other

    cs.LG cs.AI cs.MA

    JaxMARL: Multi-Agent RL Environments in JAX

    Authors: Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Gardar Ingvarsson, Timon Willi, Akbir Khan, Christian Schroeder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert Tjarko Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktaschel, Chris Lu, Jakob Nicolaus Foerster

    Abstract: Benchmarks play an important role in the development of machine learning algorithms. For example, research in reinforcement learning (RL) has been heavily influenced by available environments and benchmarks. However, RL environments are traditionally run on the CPU, limiting their scalability with typical academic compute. Recent advancements in JAX have enabled the wider use of hardware accelerat… ▽ More

    Submitted 19 December, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

  12. arXiv:2308.13049  [pdf, other

    cs.LG

    Bayesian Exploration Networks

    Authors: Mattie Fellows, Brandon Kaplowitz, Christian Schroeder de Witt, Shimon Whiteson

    Abstract: Bayesian reinforcement learning (RL) offers a principled and elegant approach for sequential decision making under uncertainty. Most notably, Bayesian agents do not face an exploration/exploitation dilemma, a major pathology of frequentist methods. However theoretical understanding of model-free approaches is lacking. In this paper, we introduce a novel Bayesian model-free formulation and the firs… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Typos fixed and provided clearer proof of Theorem 3.2

  13. First Steps Towards a Runtime Analysis of Neuroevolution

    Authors: Paul Fischer, Emil Lundt Larsen, Carsten Witt

    Abstract: We consider a simple setting in neuroevolution where an evolutionary algorithm optimizes the weights and activation functions of a simple artificial neural network. We then define simple example functions to be learned by the network and conduct rigorous runtime analyses for networks with a single neuron and for a more advanced structure with several neurons and two layers. Our results show that t… ▽ More

    Submitted 16 October, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 27 pages; full version of paper published at FOGA 2023 and available at ACM

    Journal ref: FOGA 23, 2023, 61-72

  14. arXiv:2305.07178  [pdf, other

    cs.NE cs.AI

    Fast Pareto Optimization Using Sliding Window Selection

    Authors: Frank Neumann, Carsten Witt

    Abstract: Pareto optimization using evolutionary multi-objective algorithms has been widely applied to solve constrained submodular optimization problems. A crucial factor determining the runtime of the used evolutionary algorithms to obtain good approximations is the population size of the algorithms which grows with the number of trade-offs that the algorithms encounter. In this paper, we introduce a slid… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  15. arXiv:2304.10848  [pdf, other

    cs.NE cs.AI cs.DS

    How Well Does the Metropolis Algorithm Cope With Local Optima?

    Authors: Benjamin Doerr, Taha El Ghazi El Houssaini, Amirhossein Rajabi, Carsten Witt

    Abstract: The Metropolis algorithm (MA) is a classic stochastic local search heuristic. It avoids getting stuck in local optima by occasionally accepting inferior solutions. To better and in a rigorous manner understand this ability, we conduct a mathematical runtime analysis of the MA on the CLIFF benchmark. Apart from one local optimum, cliff functions are monotonically increasing towards the global optim… ▽ More

    Submitted 15 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: To appear in the proceedings of GECCO 2023. With appendix containing all proofs. 28 pages

  16. arXiv:2304.08774  [pdf, ps, other

    cs.NE cs.AI

    3-Objective Pareto Optimization for Problems with Chance Constraints

    Authors: Frank Neumann, Carsten Witt

    Abstract: Evolutionary multi-objective algorithms have successfully been used in the context of Pareto optimization where a given constraint is relaxed into an additional objective. In this paper, we explore the use of 3-objective formulations for problems with chance constraints. Our formulation trades off the expected cost and variance of the stochastic component as well as the given deterministic constra… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  17. arXiv:2303.10733  [pdf, other

    cs.AI cs.MA

    Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

    Authors: Yat Long Lo, Christian Schroeder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson

    Abstract: By enabling agents to communicate, recent cooperative multi-agent reinforcement learning (MARL) methods have demonstrated better task performance and more coordinated behavior. Most existing approaches facilitate inter-agent communication by allowing agents to send messages to each other through free communication channels, i.e., cheap talk channels. Current methods require these channels to be co… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    Comments: The 11th International Conference on Learning Representations (ICLR)

  18. arXiv:2211.11043  [pdf, other

    econ.GN cs.AI cs.LG

    Revealing Robust Oil and Gas Company Macro-Strategies using Deep Multi-Agent Reinforcement Learning

    Authors: Dylan Radovic, Lucas Kruitwagen, Christian Schroeder de Witt, Ben Caldecott, Shane Tomlinson, Mark Workman

    Abstract: The energy transition potentially poses an existential risk for major international oil companies (IOCs) if they fail to adapt to low-carbon business models. Projections of energy futures, however, are met with diverging assumptions on its scale and pace, causing disagreement among IOC decision-makers and their stakeholders over what the business model of an incumbent fossil fuel company should be… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  19. arXiv:2210.14889  [pdf, other

    cs.CR cs.AI cs.MM

    Perfectly Secure Steganography Using Minimum Entropy Coupling

    Authors: Christian Schroeder de Witt, Samuel Sokota, J. Zico Kolter, Jakob Foerster, Martin Strohmeier

    Abstract: Steganography is the practice of encoding secret information into innocuous content in such a manner that an adversarial third party would not realize that there is hidden meaning. While this problem has classically been studied in security literature, recent advances in generative models have led to a shared interest among security and machine learning researchers in develo** scalable steganogr… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

  20. arXiv:2210.12124  [pdf, other

    cs.LG

    Equivariant Networks for Zero-Shot Coordination

    Authors: Darius Muglich, Christian Schroeder de Witt, Elise van der Pol, Shimon Whiteson, Jakob Foerster

    Abstract: Successful coordination in Dec-POMDPs requires agents to adopt robust strategies and interpretable styles of play for their partner. A common failure mode is symmetry breaking, when agents arbitrarily converge on one out of many equivalent but mutually incompatible policies. Commonly these examples include partial observability, e.g. waving your right hand vs. left hand to convey a covert message.… ▽ More

    Submitted 10 April, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

  21. arXiv:2210.05639  [pdf, other

    cs.LG cs.AI

    Discovered Policy Optimisation

    Authors: Chris Lu, Jakub Grudzien Kuba, Alistair Letcher, Luke Metz, Christian Schroeder de Witt, Jakob Foerster

    Abstract: Tremendous progress has been made in reinforcement learning (RL) over the past decade. Most of these advancements came through the continual development of new algorithms, which were designed using a combination of mathematical derivations, intuitions, and experimentation. Such an approach of creating algorithms manually is limited by human understanding and ingenuity. In contrast, meta-learning p… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  22. arXiv:2208.05670  [pdf, ps, other

    cs.NE

    Runtime Analysis of the (1+1) EA on Weighted Sums of Transformed Linear Functions

    Authors: Frank Neumann, Carsten Witt

    Abstract: Linear functions play a key role in the runtime analysis of evolutionary algorithms and studies have provided a wide range of new insights and techniques for analyzing evolutionary computation methods. Motivated by studies on separable functions and the optimization behaviour of evolutionary algorithms as well as objective functions from the area of chance constrained optimization, we study the cl… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: To appear at PPSN 2022. arXiv admin note: text overlap with arXiv:2109.05799

  23. arXiv:2207.10170  [pdf, other

    cs.AI

    Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks

    Authors: Tim Franzmeyer, Stephen McAleer, João F. Henriques, Jakob N. Foerster, Philip H. S. Torr, Adel Bibi, Christian Schroeder de Witt

    Abstract: Autonomous agents deployed in the real world need to be robust against adversarial attacks on sensory inputs. Robustifying agent policies requires anticipating the strongest attacks possible. We demonstrate that existing observation-space attacks on reinforcement learning agents have a common weakness: while effective, their lack of information-theoretic detectability constraints makes them detect… ▽ More

    Submitted 6 May, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICLR 2024 Spotlight (top 5%)

  24. arXiv:2206.12765  [pdf, other

    cs.AI cs.LG

    Generalized Beliefs for Cooperative AI

    Authors: Darius Muglich, Luisa Zintgraf, Christian Schroeder de Witt, Shimon Whiteson, Jakob Foerster

    Abstract: Self-play is a common paradigm for constructing solutions in Markov games that can yield optimal policies in collaborative settings. However, these policies often adopt highly-specialized conventions that make playing with a novel partner difficult. To address this, recent approaches rely on encoding symmetry and convention-awareness into policy training, but these require strong environmental ass… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

  25. arXiv:2205.15311  [pdf, other

    cs.NE physics.bio-ph

    Biological Evolution and Genetic Algorithms: Exploring the Space of Abstract Tile Self-Assembly

    Authors: Christian Schroeder de Witt

    Abstract: A physically-motivated genetic algorithm (GA) and full enumeration for a tile-based model of self-assembly (JaTAM) is implemented using a graphics processing unit (GPU). We observe performance gains with respect to state-of-the-art implementations on CPU of factor 7.7 for the GA and 2.9 for JaTAM. The correctness of our GA implementation is demonstrated using a test-bed fitness function, and our J… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Comments: MPhys Thesis, 2012. Awarded University of Oxford Tessella Prize

  26. arXiv:2205.13281  [pdf, other

    cs.CV

    Surround-view Fisheye Camera Perception for Automated Driving: Overview, Survey and Challenges

    Authors: Varun Ravi Kumar, Ciaran Eising, Christian Witt, Senthil Yogamani

    Abstract: Surround-view fisheye cameras are commonly used for near-field sensing in automated driving. Four fisheye cameras on four sides of the vehicle are sufficient to cover 360° around the vehicle capturing the entire near-field region. Some primary use cases are automated parking, traffic jam assist, and urban driving. There are limited datasets and very little work on near-field perception tasks as th… ▽ More

    Submitted 5 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems

  27. arXiv:2205.01447  [pdf, other

    cs.AI cs.MA

    Model-Free Opponent Sha**

    Authors: Chris Lu, Timon Willi, Christian Schroeder de Witt, Jakob Foerster

    Abstract: In general-sum games, the interaction of self-interested learning agents commonly leads to collectively worst-case outcomes, such as defect-defect in the iterated prisoner's dilemma (IPD). To overcome this, some methods, such as Learning with Opponent-Learning Awareness (LOLA), shape their opponents' learning process. However, these methods are myopic since only a small number of steps can be anti… ▽ More

    Submitted 4 November, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: ICML 2022 camera ready version. Code: https://github.com/luchris429/Model-Free-Opponent-Sha**

  28. arXiv:2205.00666  [pdf, other

    cs.CY econ.GN

    (Private)-Retroactive Carbon Pricing [(P)ReCaP]: A Market-based Approach for Climate Finance and Risk Assessment

    Authors: Yoshua Bengio, Prateek Gupta, Dylan Radovic, Maarten Scholl, Andrew Williams, Christian Schroeder de Witt, Tianyu Zhang, Yang Zhang

    Abstract: Insufficient Social Cost of Carbon (SCC) estimation methods and short-term decision-making horizons have hindered the ability of carbon emitters to properly correct for the negative externalities of climate change, as well as the capacity of nations to balance economic and climate policy. To overcome these limitations, we introduce Retrospective Social Cost of Carbon Updating (ReSCCU), a novel mec… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    MSC Class: 91B18 (Primary) 91B76; 91G40 (Secondary) ACM Class: J.4

  29. arXiv:2204.04904  [pdf, other

    cs.NE

    The Compact Genetic Algorithm Struggles on Cliff Functions

    Authors: Frank Neumann, Dirk Sudholt, Carsten Witt

    Abstract: The compact genetic algorithm (cGA) is an non-elitist estimation of distribution algorithm which has shown to be able to deal with difficult multimodal fitness landscapes that are hard to solve by elitist algorithms. In this paper, we investigate the cGA on the CLIFF function for which it has been shown recently that non-elitist evolutionary algorithms and artificial immune systems optimize it in… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: accepted at GECCO 2022

  30. Simulated Annealing is a Polynomial-Time Approximation Scheme for the Minimum Spanning Tree Problem

    Authors: Benjamin Doerr, Amirhossein Rajabi, Carsten Witt

    Abstract: We prove that Simulated Annealing with an appropriate cooling schedule computes arbitrarily tight constant-factor approximations to the minimum spanning tree problem in polynomial time. This result was conjectured by Wegener (2005). More precisely, denoting by $n, m, w_{\max}$, and $w_{\min}$ the number of vertices and edges as well as the maximum and minimum edge weight of the MST instance, we pr… ▽ More

    Submitted 22 July, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: 19 pages. Extended version of a paper at GECCO 2022. This version is accepted for publication in Algorithmica

    Journal ref: Simulated annealing is a polynomial-time approximation scheme for the minimum spanning tree problem. Algorithmica. 2023

  31. arXiv:2201.02373  [pdf, other

    cs.LG cs.AI

    Mirror Learning: A Unifying Framework of Policy Optimisation

    Authors: Jakub Grudzien Kuba, Christian Schroeder de Witt, Jakob Foerster

    Abstract: Modern deep reinforcement learning (RL) algorithms are motivated by either the generalised policy iteration (GPI) or trust-region learning (TRL) frameworks. However, algorithms that strictly respect these theoretical frameworks have proven unscalable. Surprisingly, the only known scalable algorithms violate the GPI/TRL assumptions, e.g. due to required regularisation or other heuristics. The curre… ▽ More

    Submitted 14 July, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  32. arXiv:2111.12197  [pdf, other

    cs.CR cs.AI

    Fixed Points in Cyber Space: Rethinking Optimal Evasion Attacks in the Age of AI-NIDS

    Authors: Christian Schroeder de Witt, Yongchao Huang, Philip H. S. Torr, Martin Strohmeier

    Abstract: Cyber attacks are increasing in volume, frequency, and complexity. In response, the security community is looking toward fully automating cyber defense systems using machine learning. However, so far the resultant effects on the coevolutionary dynamics of attackers and defenders have not been examined. In this whitepaper, we hypothesise that increased automation on both sides will accelerate the c… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  33. arXiv:2109.05799  [pdf, ps, other

    cs.NE

    Runtime Analysis of Single- and Multi-Objective Evolutionary Algorithms for Chance Constrained Optimization Problems with Normally Distributed Random Variables

    Authors: Frank Neumann, Carsten Witt

    Abstract: Chance constrained optimization problems allow to model problems where constraints involving stochastic components should only be violated with a small probability. Evolutionary algorithms have been applied to this scenario and shown to achieve high quality results. With this paper, we contribute to the theoretical understanding of evolutionary algorithms for chance constrained optimization. We st… ▽ More

    Submitted 9 August, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Conference version has been published at IJCAI 2022

  34. arXiv:2107.08295  [pdf, other

    cs.AI cs.MA

    Communicating via Markov Decision Processes

    Authors: Samuel Sokota, Christian Schroeder de Witt, Maximilian Igl, Luisa Zintgraf, Philip Torr, Martin Strohmeier, J. Zico Kolter, Shimon Whiteson, Jakob Foerster

    Abstract: We consider the problem of communicating exogenous information by means of Markov decision process trajectories. This setting, which we call a Markov coding game (MCG), generalizes both source coding and a large class of referential games. MCGs also isolate a problem that is important in decentralized control settings in which cheap-talk is not available -- namely, they require balancing communica… ▽ More

    Submitted 12 June, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: ICML 2022

  35. arXiv:2104.08492  [pdf, other

    cs.AI cs.LG

    A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings

    Authors: Eltayeb Ahmed, Luisa Zintgraf, Christian A. Schroeder de Witt, Nicolas Usunier

    Abstract: In this work we explore an auxiliary loss useful for reinforcement learning in environments where strong performing agents are required to be able to navigate a spatial environment. The auxiliary loss proposed is to minimize the classification error of a neural network classifier that predicts whether or not a pair of states sampled from the agents current episode trajectory are in order. The clas… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  36. arXiv:2104.04395  [pdf, other

    cs.NE

    Stagnation Detection in Highly Multimodal Fitness Landscapes

    Authors: Amirhossein Rajabi, Carsten Witt

    Abstract: Stagnation detection has been proposed as a mechanism for randomized search heuristics to escape from local optima by automatically increasing the size of the neighborhood to find the so-called gap size, i.e., the distance to the next improvement. Its usefulness has mostly been considered in simple multimodal landscapes with few local optima that could be crossed one after another. In multimodal l… ▽ More

    Submitted 22 April, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 28 pages. Full version of a paper appearing at GECCO 2021. arXiv admin note: text overlap with arXiv:2101.12054

  37. arXiv:2103.10394  [pdf, ps, other

    cs.NE cs.AI

    On Steady-State Evolutionary Algorithms and Selective Pressure: Why Inverse Rank-Based Allocation of Reproductive Trials is Best

    Authors: Dogan Corus, Andrei Lissovoi, Pietro S. Oliveto, Carsten Witt

    Abstract: We analyse the impact of the selective pressure for the global optimisation capabilities of steady-state EAs. For the standard bimodal benchmark function \twomax we rigorously prove that using uniform parent selection leads to exponential runtimes with high probability to locate both optima for the standard ($μ$+1)~EA and ($μ$+1)~RLS with any polynomial population sizes. On the other hand, we prov… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  38. arXiv:2102.07448  [pdf, other

    cs.CV cs.RO

    OmniDet: Surround View Cameras based Multi-task Visual Perception Network for Autonomous Driving

    Authors: Varun Ravi Kumar, Senthil Yogamani, Hazem Rashed, Ganesh Sistu, Christian Witt, Isabelle Leang, Stefan Milz, Patrick Mäder

    Abstract: Surround View fisheye cameras are commonly deployed in automated driving for 360° near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentati… ▽ More

    Submitted 6 June, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Best Robot Vision paper award finalist (top 4). Camera ready version accepted for RA-L and ICRA 2021 publication

  39. Stagnation Detection with Randomized Local Search

    Authors: Amirhossein Rajabi, Carsten Witt

    Abstract: Recently a mechanism called stagnation detection was proposed that automatically adjusts the mutation rate of evolutionary algorithms when they encounter local optima. The so-called $SD-(1+1)EA$ introduced by Rajabi and Witt (GECCO 2020) adds stagnation detection to the classical $(1+1)EA$ with standard bit mutation, which flips each bit independently with some mutation rate, and raises the mutati… ▽ More

    Submitted 8 February, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 24 pages. Full version of a paper appearing at EvoCOP 2021

  40. arXiv:2012.09670  [pdf, other

    cs.LG cs.AI physics.ao-ph

    RainBench: Towards Global Precipitation Forecasting from Satellite Imagery

    Authors: Christian Schroeder de Witt, Catherine Tong, Valentina Zantedeschi, Daniele De Martini, Freddie Kalaitzis, Matthew Chantry, Duncan Watson-Parris, Piotr Bilinski

    Abstract: Extreme precipitation events, such as violent rainfall and hail storms, routinely ravage economies and livelihoods around the develo** world. Climate change further aggravates this issue. Data-driven deep learning approaches could widen the access to accurate multi-day forecasts, to mitigate against such events. However, there is currently no benchmark dataset dedicated to the study of global pr… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: Work completed during the 2020 Frontier Development Lab research accelerator, a private-public partnership with NASA in the US, and ESA in Europe. Accepted as a spotlight/long oral talk at both Climate Change and AI, as well as AI for Earth Sciences Workshops at NeurIPS 2020

  41. arXiv:2011.09533  [pdf, other

    cs.AI

    Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

    Authors: Christian Schroeder de Witt, Tarun Gupta, Denys Makoviichuk, Viktor Makoviychuk, Philip H. S. Torr, Mingfei Sun, Shimon Whiteson

    Abstract: Most recently developed approaches to cooperative multi-agent reinforcement learning in the \emph{centralized training with decentralized execution} setting involve estimating a centralized, joint value function. In this paper, we demonstrate that, despite its various theoretical shortcomings, Independent PPO (IPPO), a form of independent learning in which each agent simply estimates its local val… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  42. arXiv:2010.10885  [pdf, other

    cs.NE cs.AI

    Improved Runtime Results for Simple Randomised Search Heuristics on Linear Functions with a Uniform Constraint

    Authors: Frank Neumann, Mojgan Pourhassan, Carsten Witt

    Abstract: In the last decade remarkable progress has been made in development of suitable proof techniques for analysing randomised search heuristics. The theoretical investigation of these algorithms on classes of functions is essential to the understanding of the underlying stochastic process. Linear functions have been traditionally studied in this area resulting in tight bounds on the expected optimisat… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Journal version to appear in Algorithmica

  43. arXiv:2007.06676  [pdf, other

    cs.CV cs.LG cs.RO

    UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models

    Authors: Varun Ravi Kumar, Senthil Yogamani, Markus Bach, Christian Witt, Stefan Milz, Patrick Mader

    Abstract: In classical computer vision, rectification is an integral part of multi-view depth estimation. It typically includes epipolar rectification and lens distortion correction. This process simplifies the depth estimation significantly, and thus it has been adopted in CNN approaches. However, rectification has several side effects, including a reduced field of view (FOV), resampling distortion, and se… ▽ More

    Submitted 6 June, 2023; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Minor fixes added after IROS 2020 Camera ready submission. IROS 2020 presentation video - https://www.youtube.com/watch?v=3Br2KSWZRrY

  44. Evolutionary Algorithms with Self-adjusting Asymmetric Mutation

    Authors: Amirhossein Rajabi, Carsten Witt

    Abstract: Evolutionary Algorithms (EAs) and other randomized search heuristics are often considered as unbiased algorithms that are invariant with respect to different transformations of the underlying search space. However, if a certain amount of domain knowledge is available the use of biased search operators in EAs becomes viable. We consider a simple (1+1) EA for binary search spaces and analyze an asym… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 16 pages. An extended abstract of this paper will be published in the proceedings of PPSN 2020

  45. arXiv:2006.07019  [pdf, ps, other

    cs.NE math.PR

    Improved Fixed-Budget Results via Drift Analysis

    Authors: Timo Kötzing, Carsten Witt

    Abstract: Fixed-budget theory is concerned with computing or bounding the fitness value achievable by randomized search heuristics within a given budget of fitness function evaluations. Despite recent progress in fixed-budget theory, there is a lack of general tools to derive such results. We transfer drift theory, the key tool to derive expected optimization times, to the fixed-budged perspective. A first… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 25 pages. An extended abstract of this paper will be published in the proceedings of PPSN 2020

  46. arXiv:2006.04222  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

    Authors: Shariq Iqbal, Christian A. Schroeder de Witt, Bei Peng, Wendelin Böhmer, Shimon Whiteson, Fei Sha

    Abstract: Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: ``What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?… ▽ More

    Submitted 11 June, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: ICML 2021 Camera Ready

  47. arXiv:2005.07062  [pdf, other

    cs.LG stat.AP stat.ML

    Simulation-Based Inference for Global Health Decisions

    Authors: Christian Schroeder de Witt, Bradley Gram-Hansen, Nantas Nardelli, Andrew Gambardella, Rob Zinkov, Puneet Dokania, N. Siddharth, Ana Belen Espinosa-Gonzalez, Ara Darzi, Philip Torr, Atılım Güneş Baydin

    Abstract: The COVID-19 pandemic has highlighted the importance of in-silico epidemiological modelling in predicting the dynamics of infectious diseases to inform health policy and decision makers about suitable prevention and containment strategies. Work in this setting involves solving challenging inference and control problems in individual-based models of ever increasing complexity. Here we discuss recen… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Journal ref: ICML Workshop on Machine Learning for Global Health, Thirty-Seventh International Conference on Machine Learning (ICML 2020)

  48. Self-Adjusting Evolutionary Algorithms for Multimodal Optimization

    Authors: Amirhossein Rajabi, Carsten Witt

    Abstract: Recent theoretical research has shown that self-adjusting and self-adaptive mechanisms can provably outperform static settings in evolutionary algorithms for binary search spaces. However, the vast majority of these studies focuses on unimodal functions which do not require the algorithm to flip several bits simultaneously to make progress. In fact, existing self-adjusting algorithms are not desig… ▽ More

    Submitted 2 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 26 pages. Full version of a paper appearing at GECCO 2020

  49. arXiv:2003.08839  [pdf, other

    cs.LG cs.MA stat.ML

    Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

    Authors: Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson

    Abstract: In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is available and communication constraints are lifted. Learning joint action-values conditioned on extra state information is an attractive way to exploit centralised l… ▽ More

    Submitted 27 August, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Extended version of the ICML 2018 conference paper (arXiv:1803.11485)

    Journal ref: Journal of Machine Learning Research 21(178):1-51, 2020

  50. arXiv:2003.06709  [pdf, other

    cs.LG cs.AI stat.ML

    FACMAC: Factored Multi-Agent Centralised Policy Gradients

    Authors: Bei Peng, Tabish Rashid, Christian A. Schroeder de Witt, Pierre-Alexandre Kamienny, Philip H. S. Torr, Wendelin Böhmer, Shimon Whiteson

    Abstract: We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method, our approach uses deep deterministic policy gradients to learn policies. However, FACMAC learns a centralised but factored critic, which combines per-agent utilit… ▽ More

    Submitted 7 May, 2021; v1 submitted 14 March, 2020; originally announced March 2020.