Skip to main content

Showing 1–10 of 10 results for author: Letcher, A

.
  1. arXiv:2309.12681  [pdf, other

    quant-ph

    Tight and Efficient Gradient Bounds for Parameterized Quantum Circuits

    Authors: Alistair Letcher, Stefan Woerner, Christa Zoufal

    Abstract: The training of a parameterized model largely depends on the landscape of the underlying loss function. In particular, vanishing gradients (also known as barren plateaus) are a central bottleneck in the scalability of variational quantum algorithms (VQAs), and are known to arise in various ways, from circuit depth and hardware noise to global observables. However, a caveat of most existing gradien… ▽ More

    Submitted 15 February, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  2. arXiv:2211.11030  [pdf, other

    cs.LG cs.AI cs.CR

    Adversarial Cheap Talk

    Authors: Chris Lu, Timon Willi, Alistair Letcher, Jakob Foerster

    Abstract: Adversarial attacks in reinforcement learning (RL) often assume highly-privileged access to the victim's parameters, environment, or data. Instead, this paper proposes a novel adversarial setting called a Cheap Talk MDP in which an Adversary can merely append deterministic messages to the Victim's observation, resulting in a minimal range of influence. The Adversary cannot occlude ground truth, in… ▽ More

    Submitted 11 July, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: To be published at ICML 2023. Project video and code are available at https://sites.google.com/view/adversarial-cheap-talk

  3. arXiv:2210.05639  [pdf, other

    cs.LG cs.AI

    Discovered Policy Optimisation

    Authors: Chris Lu, Jakub Grudzien Kuba, Alistair Letcher, Luke Metz, Christian Schroeder de Witt, Jakob Foerster

    Abstract: Tremendous progress has been made in reinforcement learning (RL) over the past decade. Most of these advancements came through the continual development of new algorithms, which were designed using a combination of mathematical derivations, intuitions, and experimentation. Such an approach of creating algorithms manually is limited by human understanding and ingenuity. In contrast, meta-learning p… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  4. arXiv:2203.04098  [pdf, other

    cs.LG cs.AI cs.GT

    COLA: Consistent Learning with Opponent-Learning Awareness

    Authors: Timon Willi, Alistair Letcher, Johannes Treutlein, Jakob Foerster

    Abstract: Learning in general-sum games is unstable and frequently leads to socially undesirable (Pareto-dominated) outcomes. To mitigate this, Learning with Opponent-Learning Awareness (LOLA) introduced opponent sha** to this setting, by accounting for each agent's influence on their opponents' anticipated learning steps. However, the original LOLA formulation (and follow-up work) is inconsistent because… ▽ More

    Submitted 27 June, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted @ ICML 2022

  5. arXiv:2111.08565  [pdf, other

    cs.LG cs.MA math.OC

    Polymatrix Competitive Gradient Descent

    Authors: Jeffrey Ma, Alistair Letcher, Florian Schäfer, Yuanyuan Shi, Anima Anandkumar

    Abstract: Many economic games and machine learning approaches can be cast as competitive optimization problems where multiple agents are minimizing their respective objective function, which depends on all agents' actions. While gradient descent is a reliable basic workhorse for single-agent optimization, it often leads to oscillation in competitive optimization. In this work we propose polymatrix competiti… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  6. arXiv:2011.06505  [pdf, other

    cs.LG

    Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian

    Authors: Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alex Peysakhovich, Aldo Pacchiano, Jakob Foerster

    Abstract: Over the last decade, a single algorithm has changed many facets of our lives - Stochastic Gradient Descent (SGD). In the era of ever decreasing loss functions, SGD and its various offspring have become the go-to optimization tool in machine learning and are a key component of the success of deep neural networks (DNNs). While SGD is guaranteed to converge to a local optimum (under loose assumption… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Camera-ready version, NeurIPS 2020

  7. arXiv:2005.12649  [pdf, other

    math.OC cs.GT cs.LG cs.MA

    On the Impossibility of Global Convergence in Multi-Loss Optimization

    Authors: Alistair Letcher

    Abstract: Under mild regularity conditions, gradient-based methods converge globally to a critical point in the single-loss setting. This is known to break down for vanilla gradient descent when moving to multi-loss optimization, but can we hope to build some algorithm with global guarantees? We negatively resolve this open problem by proving that desirable convergence properties cannot simultaneously hold… ▽ More

    Submitted 17 January, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 26 pages, 3 figures

  8. arXiv:1905.04926  [pdf, other

    cs.LG cs.GT cs.MA cs.NE stat.ML

    Differentiable Game Mechanics

    Authors: Alistair Letcher, David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, Thore Graepel

    Abstract: Deep learning is built on the foundational guarantee that gradient descent on an objective function converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, that exhibit multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objecti… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: JMLR 2019, journal version of arXiv:1802.05642

    Journal ref: Journal of Machine Learning Research (JMLR), v20 (84) 1-40, 2019

  9. arXiv:1811.08469  [pdf, other

    cs.MA cs.AI cs.LG

    Stable Opponent Sha** in Differentiable Games

    Authors: Alistair Letcher, Jakob Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson

    Abstract: A growing number of learning methods are actually differentiable games whose players optimise multiple, interdependent objectives in parallel -- from GANs and intrinsic curiosity to multi-agent RL. Opponent sha** is a powerful approach to improve learning dynamics in these games, accounting for player influence on others' updates. Learning with Opponent-Learning Awareness (LOLA) is a recent algo… ▽ More

    Submitted 17 January, 2021; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: 20 pages, 7 figures

  10. arXiv:1711.05355  [pdf, other

    eess.AS cs.SD stat.ML

    Automatic Conflict Detection in Police Body-Worn Audio

    Authors: Alistair Letcher, Jelena Trišović, Collin Cademartori, Xi Chen, Jason Xu

    Abstract: Automatic conflict detection has grown in relevance with the advent of body-worn technology, but existing metrics such as turn-taking and overlap are poor indicators of conflict in police-public interactions. Moreover, standard techniques to compute them fall short when applied to such diversified and noisy contexts. We develop a pipeline catered to this task combining adaptive noise removal, non-… ▽ More

    Submitted 14 February, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: 5 pages, 2 figures, 1 table