Skip to main content

Showing 1–11 of 11 results for author: Aitchison, M

.
  1. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2401.14953  [pdf, other

    cs.LG cs.AI

    Learning Universal Predictors

    Authors: Jordi Grau-Moya, Tim Genewein, Marcus Hutter, Laurent Orseau, Grégoire Delétang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison, Joel Veness

    Abstract: Meta-learning has emerged as a powerful approach to train neural networks to learn new tasks quickly from limited data. Broad exposure to different tasks leads to versatile representations enabling general problem solving. But, what are the limits of meta-learning? In this work, we explore the potential of amortizing the most powerful universal predictor, namely Solomonoff Induction (SI), into neu… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 32 pages, 11 figures

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2312.07358  [pdf, other

    stat.ML cs.LG

    Distributional Bellman Operators over Mean Embeddings

    Authors: Li Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland

    Abstract: We propose a novel algorithmic framework for distributional reinforcement learning, based on learning finite-dimensional mean embeddings of return distributions. We derive several new algorithms for dynamic programming and temporal-difference learning based on this framework, provide asymptotic convergence theory, and examine the empirical performance of the algorithms on a suite of tabular tasks.… ▽ More

    Submitted 4 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  5. arXiv:2309.10668  [pdf, other

    cs.LG cs.AI cs.CL cs.IT

    Language Modeling Is Compression

    Authors: Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein, Christopher Mattern, Jordi Grau-Moya, Li Kevin Wenliang, Matthew Aitchison, Laurent Orseau, Marcus Hutter, Joel Veness

    Abstract: It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training increasingly large and powerful self-supervised (language) models. Since these large language models exhibit impressive predictive capabilities, they are well-positioned to be strong compressors. In th… ▽ More

    Submitted 18 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  6. arXiv:2210.02019  [pdf, other

    cs.AI cs.LG

    Atari-5: Distilling the Arcade Learning Environment down to Five Games

    Authors: Matthew Aitchison, Penny Sweetser, Marcus Hutter

    Abstract: The Arcade Learning Environment (ALE) has become an essential benchmark for assessing the performance of reinforcement learning algorithms. However, the computational cost of generating results on the entire 57-game dataset limits ALE's use and makes the reproducibility of many results infeasible. We propose a novel solution to this problem in the form of a principled methodology for selecting sma… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  7. Learning to Deceive in Multi-Agent Hidden Role Games

    Authors: Matthew Aitchison, Lyndon Benke, Penny Sweetser

    Abstract: Deception is prevalent in human social settings. However, studies into the effect of deception on reinforcement learning algorithms have been limited to simplistic settings, restricting their applicability to complex real-world problems. This paper addresses this by introducing a new mixed competitive-cooperative multi-agent reinforcement learning (MARL) environment inspired by popular role-based… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Journal ref: In: Deceptive AI. DeceptECAI DeceptAI 2020 2021. Communications in Computer and Information Science, vol 1296. Springer, Cham (2021)

  8. arXiv:2206.10027  [pdf, other

    cs.LG cs.AI

    DNA: Proximal Policy Optimization with a Dual Network Architecture

    Authors: Matthew Aitchison, Penny Sweetser

    Abstract: This paper explores the problem of simultaneously learning a value function and policy in deep actor-critic reinforcement learning models. We find that the common practice of learning these functions jointly is sub-optimal, due to an order-of-magnitude difference in noise levels between these two tasks. Instead, we show that learning these tasks independently, but with a constrained distillation p… ▽ More

    Submitted 13 November, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Published at NeurIPS 2022

  9. arXiv:2007.06615  [pdf

    cond-mat.mtrl-sci cond-mat.soft

    Impact of chemical structure on the dynamics of mass transfer of water in conjugated microporous polymers: A neutron spectroscopy study

    Authors: Anne A. Y. Guilbert, Yang Bai, Catherine M. Aitchison, Reiner S. Sprick, Mohamed Zbiri

    Abstract: Hydrogen fuel can contribute as a masterpiece in conceiving a robust carbon-free economic puzzle if cleaner methods to produce hydrogen become technically efficient and economically viable. Organic photocatalytic materials such as conjugated microporous materials (CMPs) are potential attractive candidates for water splitting as their energy levels and optical bandgap as well as porosity are tunabl… ▽ More

    Submitted 14 February, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Journal ref: ACS Appl. Polym. Mater. 3 (2021) 765-776

  10. arXiv:2006.15362  [pdf

    cond-mat.mtrl-sci cond-mat.soft

    Probing dynamics of water mass transfer in organic porous photocatalyst water-splitting materials by neutron spectroscopy

    Authors: Mohamed Zbiri, Catherine M. Aitchison, Reiner S. Sprick, Andrew I. Cooper, Anne A. Y. Guilbert

    Abstract: The quest for efficient and economically accessible cleaner methods to develop sustainable carbon-free energy sources induced a keen interest in the production of hydrogen fuel. This can be achieved via the water-splitting process exploiting solar energy but requiring the use of adequate photocatalysts. Covalent triazine-based frameworks (CTFs) are target photocatalysts for water-splitting. Both e… ▽ More

    Submitted 23 February, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Journal ref: Chemistry of Materials, 33 (2021) 1363

  11. arXiv:1906.09734  [pdf, other

    cs.LG stat.ML

    Optimal Use of Experience in First Person Shooter Environments

    Authors: Matthew Aitchison

    Abstract: Although reinforcement learning has made great strides recently, a continuing limitation is that it requires an extremely high number of interactions with the environment. In this paper, we explore the effectiveness of reusing experience from the experience replay buffer in the Deep Q-Learning algorithm. We test the effectiveness of applying learning update steps multiple times per environmental s… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.