Skip to main content

Showing 1–15 of 15 results for author: Ruoss, A

.
  1. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2402.04494  [pdf, other

    cs.LG cs.AI stat.ML

    Grandmaster-Level Chess Without Search

    Authors: Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Tim Genewein

    Abstract: The recent breakthrough successes in machine learning are mainly attributed to scale: namely large-scale attention-based architectures and datasets of unprecedented scale. This paper investigates the impact of training at scale for chess. Unlike traditional chess engines that rely on complex heuristics, explicit search, or a combination of both, we train a 270M parameter transformer model with sup… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  3. arXiv:2401.14953  [pdf, other

    cs.LG cs.AI

    Learning Universal Predictors

    Authors: Jordi Grau-Moya, Tim Genewein, Marcus Hutter, Laurent Orseau, Grégoire Delétang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison, Joel Veness

    Abstract: Meta-learning has emerged as a powerful approach to train neural networks to learn new tasks quickly from limited data. Broad exposure to different tasks leads to versatile representations enabling general problem solving. But, what are the limits of meta-learning? In this work, we explore the potential of amortizing the most powerful universal predictor, namely Solomonoff Induction (SI), into neu… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 32 pages, 11 figures

  4. arXiv:2312.07358  [pdf, other

    stat.ML cs.LG

    Distributional Bellman Operators over Mean Embeddings

    Authors: Li Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland

    Abstract: We propose a novel algorithmic framework for distributional reinforcement learning, based on learning finite-dimensional mean embeddings of return distributions. We derive several new algorithms for dynamic programming and temporal-difference learning based on this framework, provide asymptotic convergence theory, and examine the empirical performance of the algorithms on a suite of tabular tasks.… ▽ More

    Submitted 4 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  5. arXiv:2311.03583  [pdf, other

    cs.AI cs.DM cs.LG

    Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

    Authors: Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner

    Abstract: This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erdős, which aims to find graphs with a given size (number of nodes) that maximize the number of edges without having 3- or 4-cycles. We formulate this problem as a sequential decision-making problem and compare AlphaZero, a neural network-guided tree search, with tabu search, a heuristic local search method… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at MATH AI workshop at NeurIPS 2023, First three authors contributed equally, Last two authors have equal senior contribution

  6. arXiv:2309.10668  [pdf, other

    cs.LG cs.AI cs.CL cs.IT

    Language Modeling Is Compression

    Authors: Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein, Christopher Mattern, Jordi Grau-Moya, Li Kevin Wenliang, Matthew Aitchison, Laurent Orseau, Marcus Hutter, Joel Veness

    Abstract: It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training increasingly large and powerful self-supervised (language) models. Since these large language models exhibit impressive predictive capabilities, they are well-positioned to be strong compressors. In th… ▽ More

    Submitted 18 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  7. arXiv:2305.16843  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Randomized Positional Encodings Boost Length Generalization of Transformers

    Authors: Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness

    Abstract: Transformers have impressive generalization capabilities on tasks with a fixed context length. However, they fail to generalize to sequences of arbitrary length, even for seemingly simple tasks such as duplicating a string. Moreover, simply training on longer sequences is inefficient due to the quadratic computation complexity of the global attention mechanism. In this work, we demonstrate that th… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2302.03067  [pdf, other

    cs.LG cs.AI stat.ML

    Memory-Based Meta-Learning on Non-Stationary Distributions

    Authors: Tim Genewein, Grégoire Delétang, Anian Ruoss, Li Kevin Wenliang, Elliot Catt, Vincent Dutordoir, Jordi Grau-Moya, Laurent Orseau, Marcus Hutter, Joel Veness

    Abstract: Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work is to investigate how far this interpretation can be realized by current sequence prediction models and training regimes. The focus is on piecewise stationary s… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  9. arXiv:2209.15618  [pdf, other

    cs.AI cs.LG

    Beyond Bayes-optimality: meta-learning what you know you don't know

    Authors: Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Tim Genewein, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

    Abstract: Meta-training agents with memory has been shown to culminate in Bayes-optimal agents, which casts Bayes-optimality as the implicit solution to a numerical optimization problem rather than an explicit modeling assumption. Bayes-optimal agents are risk-neutral, since they solely attune to the expected return, and ambiguity-neutral, since they act in new situations as if the uncertainty were known. T… ▽ More

    Submitted 12 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 33 pages, 8 figures, technical report

  10. arXiv:2207.02098  [pdf, other

    cs.LG cs.AI cs.CL cs.FL

    Neural Networks and the Chomsky Hierarchy

    Authors: Grégoire Delétang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A. Ortega

    Abstract: Reliable generalization lies at the heart of safe ML and AI. However, understanding when and how neural networks generalize remains one of the most important unsolved problems in the field. In this work, we conduct an extensive empirical study (20'910 models, 15 tasks) to investigate whether insights from the theory of computation can predict the limits of neural network generalization in practice… ▽ More

    Submitted 28 February, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

  11. arXiv:2111.13650  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Latent Space Smoothing for Individually Fair Representations

    Authors: Momchil Peychev, Anian Ruoss, Mislav Balunović, Maximilian Baader, Martin Vechev

    Abstract: Fair representation learning transforms user data into a representation that ensures fairness and utility regardless of the downstream application. However, learning individually fair representations, i.e., guaranteeing that similar individuals are treated similarly, remains challenging in high-dimensional settings such as computer vision. In this work, we introduce LASSI, the first representation… ▽ More

    Submitted 26 July, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  12. arXiv:2106.05937  [pdf, other

    cs.LG cs.AI

    Fair Normalizing Flows

    Authors: Mislav Balunović, Anian Ruoss, Martin Vechev

    Abstract: Fair representation learning is an attractive approach that promises fairness of downstream predictors by encoding sensitive data. Unfortunately, recent work has shown that strong adversarial predictors can still exhibit unfairness by recovering sensitive attributes from these representations. In this work, we present Fair Normalizing Flows (FNF), a new approach offering more rigorous fairness gua… ▽ More

    Submitted 17 March, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

  13. arXiv:2103.16652  [pdf, other

    cs.LG cs.AI cs.CV

    Robustness Certification for Point Cloud Models

    Authors: Tobias Lorenz, Anian Ruoss, Mislav Balunović, Gagandeep Singh, Martin Vechev

    Abstract: The use of deep 3D point cloud models in safety-critical applications, such as autonomous driving, dictates the need to certify the robustness of these models to real-world transformations. This is technically challenging, as it requires a scalable verifier tailored to point cloud models that handles a wide range of semantic 3D transformations. In this work, we address this challenge and introduce… ▽ More

    Submitted 23 August, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: International Conference on Computer Vision (ICCV) 2021

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2021, pp. 7608-7618

  14. arXiv:2009.09318  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Efficient Certification of Spatial Robustness

    Authors: Anian Ruoss, Maximilian Baader, Mislav Balunović, Martin Vechev

    Abstract: Recent work has exposed the vulnerability of computer vision models to vector field attacks. Due to the widespread usage of such models in safety-critical applications, it is crucial to quantify their robustness against such spatial transformations. However, existing work only provides empirical robustness quantification against vector field deformations via adversarial attacks, which lack provabl… ▽ More

    Submitted 30 January, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: Conference Paper at AAAI 2021

  15. arXiv:2002.10312  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Certified Individually Fair Representations

    Authors: Anian Ruoss, Mislav Balunović, Marc Fischer, Martin Vechev

    Abstract: Fair representation learning provides an effective way of enforcing fairness constraints without compromising utility for downstream users. A desirable family of such fairness constraints, each requiring similar treatment for similar individuals, is known as individual fairness. In this work, we introduce the first method that enables data consumers to obtain certificates of individual fairness fo… ▽ More

    Submitted 28 November, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Conference Paper at NeurIPS 2020