Skip to main content

Showing 1–50 of 51 results for author: Legg, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. arXiv:2311.02462  [pdf, ps, other

    cs.AI

    Levels of AGI for Operationalizing Progress on the Path to AGI

    Authors: Meredith Ringel Morris, Jascha Sohl-dickstein, Noah Fiedel, Tris Warkentin, Allan Dafoe, Aleksandra Faust, Clement Farabet, Shane Legg

    Abstract: We propose a framework for classifying the capabilities and behavior of Artificial General Intelligence (AGI) models and their precursors. This framework introduces levels of AGI performance, generality, and autonomy, providing a common language to compare models, assess risks, and measure progress along the path to AGI. To develop our framework, we analyze existing definitions of AGI, and distill… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: version 4 - Position Paper accepted to ICML 2024. Note that due to ICML position paper titling format requirements, the title has changed slightly from that of the original arXiv pre-print. The original pre-print title was "Levels of AGI: Operationalizing Progress on the Path to AGI" but the official published title for ICML 2024 is "Levels of AGI for Operationalizing Progress on the Path to AGI"

    Journal ref: Proceedings of ICML 2024

  3. arXiv:2307.15771  [pdf, other

    cs.LG cs.AI cs.CL

    The Hydra Effect: Emergent Self-repair in Language Model Computations

    Authors: Thomas McGrath, Matthew Rahtz, Janos Kramar, Vladimir Mikulik, Shane Legg

    Abstract: We investigate the internal structure of language model computations using causal analysis and demonstrate two motifs: (1) a form of adaptive computation where ablations of one attention layer of a language model cause another layer to compensate (which we term the Hydra effect) and (2) a counterbalancing function of late MLP layers that act to downregulate the maximum-likelihood token. Our ablati… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  4. arXiv:2305.16843  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Randomized Positional Encodings Boost Length Generalization of Transformers

    Authors: Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness

    Abstract: Transformers have impressive generalization capabilities on tasks with a fixed context length. However, they fail to generalize to sequences of arbitrary length, even for seemingly simple tasks such as duplicating a string. Moreover, simply training on longer sequences is inefficient due to the quadratic computation complexity of the global attention mechanism. In this work, we demonstrate that th… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  5. arXiv:2209.15618  [pdf, other

    cs.AI cs.LG

    Beyond Bayes-optimality: meta-learning what you know you don't know

    Authors: Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Tim Genewein, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

    Abstract: Meta-training agents with memory has been shown to culminate in Bayes-optimal agents, which casts Bayes-optimality as the implicit solution to a numerical optimization problem rather than an explicit modeling assumption. Bayes-optimal agents are risk-neutral, since they solely attune to the expected return, and ambiguity-neutral, since they act in new situations as if the uncertainty were known. T… ▽ More

    Submitted 12 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 33 pages, 8 figures, technical report

  6. arXiv:2207.02098  [pdf, other

    cs.LG cs.AI cs.CL cs.FL

    Neural Networks and the Chomsky Hierarchy

    Authors: Grégoire Delétang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A. Ortega

    Abstract: Reliable generalization lies at the heart of safe ML and AI. However, understanding when and how neural networks generalize remains one of the most important unsolved problems in the field. In this work, we conduct an extensive empirical study (20'910 models, 15 tasks) to investigate whether insights from the theory of computation can predict the limits of neural network generalization in practice… ▽ More

    Submitted 28 February, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

  7. arXiv:2203.12592  [pdf, other

    cs.LG stat.ML

    Your Policy Regularizer is Secretly an Adversary

    Authors: Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Shane Legg, Pedro Ortega

    Abstract: Policy regularization methods such as maximum entropy regularization are widely used in reinforcement learning to improve the robustness of a learned policy. In this paper, we show how this robustness arises from hedging against worst-case perturbations of the reward function, which are chosen from a limited set by an imagined adversary. Using convex duality, we characterize this robust set of adv… ▽ More

    Submitted 8 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Transactions on Machine Learning Research

    Journal ref: TMLR (2022) https://openreview.net/forum?id=berNQMTYWZ

  8. arXiv:2201.08102  [pdf, other

    cs.LG

    Safe Deep RL in 3D Environments using Human Feedback

    Authors: Matthew Rahtz, Vikrant Varma, Ramana Kumar, Zachary Kenton, Shane Legg, Jan Leike

    Abstract: Agents should avoid unsafe behaviour during both training and deployment. This typically requires a simulator and a procedural specification of unsafe behaviour. Unfortunately, a simulator is not always available, and procedurally specifying constraints can be difficult or impossible for many real-world tasks. A recently introduced technique, ReQueST, aims to solve this problem by learning a neura… ▽ More

    Submitted 21 January, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  9. arXiv:2111.02907  [pdf, other

    cs.LG

    Model-Free Risk-Sensitive Reinforcement Learning

    Authors: Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A. Ortega

    Abstract: We extend temporal-difference (TD) learning in order to obtain risk-sensitive, model-free reinforcement learning algorithms. This extension can be regarded as modification of the Rescorla-Wagner rule, where the (sigmoidal) stimulus is taken to be either the event of over- or underestimating the TD target. As a result, one obtains a stochastic approximation rule for estimating the free energy from… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: DeepMind Tech Report: 13 pages, 4 figures

  10. arXiv:2110.10819  [pdf, other

    cs.LG cs.AI

    Shaking the foundations: delusions in sequence models for interaction and control

    Authors: Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

    Abstract: The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive however is purposeful adaptive behavior. Currently there is a common perception that sequence models "lack the understanding of the cause and effect of… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: DeepMind Tech Report, 16 pages, 4 figures

  11. arXiv:2103.03938  [pdf, other

    cs.AI cs.LG

    Causal Analysis of Agent Behavior for AI Safety

    Authors: Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

    Abstract: As machine learning systems become more powerful they also become increasingly unpredictable and opaque. Yet, finding human-understandable explanations of how they work is essential for their safe deployment. This technical report illustrates a methodology for investigating the causal mechanisms that drive the behaviour of artificial agents. Six use cases are covered, each addressing a typical que… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: 16 pages, 16 figures, 6 tables

  12. arXiv:2102.01685  [pdf, ps, other

    cs.AI cs.LG

    Agent Incentives: A Causal Perspective

    Authors: Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg

    Abstract: We present a framework for analysing agent incentives using causal influence diagrams. We establish that a well-known criterion for value of information is complete. We propose a new graphical criterion for value of control, establishing its soundness and completeness. We also introduce two new concepts for incentive analysis: response incentives indicate which changes in the environment affect an… ▽ More

    Submitted 15 March, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: In Proceedings of the AAAI 2021 Conference. Supersedes arXiv:1902.09980, arXiv:2001.07118

  13. arXiv:2011.08827  [pdf, other

    cs.LG cs.AI

    Avoiding Tampering Incentives in Deep RL via Decoupled Approval

    Authors: Jonathan Uesato, Ramana Kumar, Victoria Krakovna, Tom Everitt, Richard Ngo, Shane Legg

    Abstract: How can we design agents that pursue a given objective when all feedback mechanisms are influenceable by the agent? Standard RL algorithms assume a secure reward function, and can thus perform poorly in settings where agents can tamper with the reward-generating mechanism. We present a principled solution to the problem of learning from influenceable feedback, which combines approval with a decoup… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  14. arXiv:2011.08820  [pdf, other

    cs.LG cs.AI

    REALab: An Embedded Perspective on Tampering

    Authors: Ramana Kumar, Jonathan Uesato, Richard Ngo, Tom Everitt, Victoria Krakovna, Shane Legg

    Abstract: This paper describes REALab, a platform for embedded agency research in reinforcement learning (RL). REALab is designed to model the structure of tampering problems that may arise in real-world deployments of RL. Standard Markov Decision Process (MDP) formulations of RL and simulated environments mirroring the MDP structure assume secure access to feedback (e.g., rewards). This may be unrealistic… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  15. arXiv:2010.12237  [pdf, other

    cs.AI cs.LG

    Algorithms for Causal Reasoning in Probability Trees

    Authors: Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

    Abstract: Probability trees are one of the simplest models of causal generative processes. They possess clean semantics and -- unlike causal Bayesian networks -- they can represent context-specific causal dependencies, which are necessary for e.g. causal induction. Yet, they have received little attention from the AI and ML community. Here we present concrete algorithms for causal reasoning in discrete prob… ▽ More

    Submitted 11 November, 2020; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: (2nd version with correction to algorithm) 11 pages, 8 figures, 5 algorithms. A companion Colaboratory tutorial is available at https://github.com/deepmind/deepmind-research/tree/master/causal_reasoning

  16. arXiv:2010.11223  [pdf, other

    cs.AI cs.LG cs.NE

    Meta-trained agents implement Bayes-optimal agents

    Authors: Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

    Abstract: Memory-based meta-learning is a powerful technique to build agents that adapt fast to any task within a target distribution. A previous theoretical study has argued that this remarkable performance is because the meta-training protocol incentivises agents to behave Bayes-optimally. We empirically investigate this claim on a number of prediction and bandit tasks. Inspired by ideas from theoretical… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Published at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  17. arXiv:2010.07877  [pdf, other

    cs.LG cs.AI

    Avoiding Side Effects By Considering Future Tasks

    Authors: Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg

    Abstract: Designing reward functions is difficult: the designer has to specify what to do (what it means to complete the task) as well as what not to do (side effects that should be avoided while completing the task). To alleviate the burden on the reward designer, we propose an algorithm to automatically generate an auxiliary reward function that penalizes side effects. This auxiliary objective rewards the… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Published in NeurIPS 2020

  18. arXiv:2006.13900  [pdf, other

    cs.LG cs.AI stat.ML

    Quantifying Differences in Reward Functions

    Authors: Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike

    Abstract: For many tasks, the reward function is inaccessible to introspection or too complex to be specified procedurally, and must instead be learned from user data. Prior work has evaluated learned reward functions by evaluating policies optimized for the learned reward. However, this method cannot distinguish between the learned reward function failing to reflect user preferences and the policy optimiza… ▽ More

    Submitted 17 March, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021. 9 pages main paper, 42 pages total

    ACM Class: I.2.6

  19. arXiv:2004.13654  [pdf, other

    cs.AI

    Pitfalls of learning a reward function online

    Authors: Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg

    Abstract: In some agent designs like inverse reinforcement learning an agent needs to learn its own reward function. Learning the reward function and optimising for it are typically two different processes, usually performed at different stages. We consider a continual (``one life'') learning approach where the agent both learns the reward function and optimises for it at the same time. We show that this co… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  20. arXiv:2001.07118  [pdf, ps, other

    cs.AI cs.LG

    The Incentives that Shape Behaviour

    Authors: Ryan Carey, Eric Langlois, Tom Everitt, Shane Legg

    Abstract: Which variables does an agent have an incentive to control with its decision, and which variables does it have an incentive to respond to? We formalise these incentives, and demonstrate unique graphical criteria for detecting them in any single decision causal influence diagram. To this end, we introduce structural causal influence models, a hybrid of the influence diagram and structural causal mo… ▽ More

    Submitted 15 March, 2021; v1 submitted 20 January, 2020; originally announced January 2020.

    Comments: In SafeAI workshop at AAAI. Superseded by arXiv:2102.01685

    ACM Class: I.2.6; I.2.8

  21. arXiv:1912.05652  [pdf, other

    cs.CY cs.LG stat.ML

    Learning Human Objectives by Evaluating Hypothetical Behavior

    Authors: Siddharth Reddy, Anca D. Dragan, Sergey Levine, Shane Legg, Jan Leike

    Abstract: We seek to align agent behavior with a user's objectives in a reinforcement learning setting with unknown dynamics, an unknown reward function, and unknown unsafe states. The user knows the rewards and unsafe states, but querying the user is expensive. To address this challenge, we propose an algorithm that safely and interactively learns a model of the user's reward function. We start with a gene… ▽ More

    Submitted 24 March, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Published at International Conference on Machine Learning (ICML) 2020

  22. arXiv:1906.08663  [pdf, other

    cs.AI

    Modeling AGI Safety Frameworks with Causal Influence Diagrams

    Authors: Tom Everitt, Ramana Kumar, Victoria Krakovna, Shane Legg

    Abstract: Proposals for safe AGI systems are typically made at the level of frameworks, specifying how the components of the proposed system should be trained and interact with each other. In this paper, we model and compare the most promising AGI safety frameworks using causal influence diagrams. The diagrams show the optimization objective and causal assumptions of the framework. The unified representatio… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: IJCAI 2019 AI Safety Workshop

  23. arXiv:1905.03030  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-learning of Sequential Strategies

    Authors: Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

    Abstract: In this report we review memory-based meta-learning as a tool for building sample-efficient strategies that learn from past experience to adapt to any task within a target class. Our goal is to equip the reader with the conceptual foundations of this tool for building new, scalable agents that operate on broad domains. To do so, we present basic algorithmic templates for building near-optimal pred… ▽ More

    Submitted 18 July, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: DeepMind Technical Report (15 pages, 6 figures). Version V1.1

  24. arXiv:1902.09980  [pdf, ps, other

    cs.AI cs.LG

    Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings

    Authors: Tom Everitt, Pedro A. Ortega, Elizabeth Barnes, Shane Legg

    Abstract: Agents are systems that optimize an objective function in an environment. Together, the goal and the environment induce secondary objectives, incentives. Modeling the agent-environment interaction using causal influence diagrams, we can answer two fundamental questions about an agent's incentives directly from the graph: (1) which nodes can the agent have an incentivize to observe, and (2) which n… ▽ More

    Submitted 20 January, 2022; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Mostly superseded by arXiv:2102.01685

    ACM Class: I.2.6; I.2.8

  25. arXiv:1901.02230  [pdf, other

    cs.LG stat.ML

    Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

    Authors: Laurent Orseau, Tor Lattimore, Shane Legg

    Abstract: We consider prediction with expert advice under the log-loss with the goal of deriving efficient and robust algorithms. We argue that existing algorithms such as exponentiated gradient, online gradient descent and online Newton step do not adequately satisfy both requirements. Our main contribution is an analysis of the Prod algorithm that is robust to any data sequence and runs in linear time rel… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Journal ref: Algorithmic Learning Theory 2017

  26. arXiv:1812.05979  [pdf, ps, other

    cs.LG cs.CR cs.NE

    Scaling shared model governance via model splitting

    Authors: Miljan Martic, Jan Leike, Andrew Trask, Matteo Hessel, Shane Legg, Pushmeet Kohli

    Abstract: Currently the only techniques for sharing governance of a deep learning model are homomorphic encryption and secure multiparty computation. Unfortunately, neither of these techniques is applicable to the training of large neural networks due to their large computational and communication overheads. As a scalable technique for shared model governance, we propose splitting deep learning model betwee… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

    Comments: 9 pages

  27. arXiv:1811.07871  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Scalable agent alignment via reward modeling: a research direction

    Authors: Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg

    Abstract: One obstacle to applying reinforcement learning algorithms to real-world problems is the lack of suitable reward functions. Designing such reward functions is difficult in part because the user only has an implicit understanding of the task objective. This gives rise to the agent alignment problem: how do we create agents that behave in accordance with the user's intentions? We outline a high-leve… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  28. arXiv:1811.06521  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Reward learning from human preferences and demonstrations in Atari

    Authors: Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei

    Abstract: To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions. Instead, we can have humans communicate an objective to the agent directly. In this work, we combine two approaches to learning from human feedback: expert demonstrations and trajectory preferences. We train a deep neural network to model the reward function and use its predicte… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  29. arXiv:1807.00196  [pdf, other

    cs.AI

    Modeling Friends and Foes

    Authors: Pedro A. Ortega, Shane Legg

    Abstract: How can one detect friendly and adversarial behavior from raw data? Detecting whether an environment is a friend, a foe, or anything in between, remains a poorly understood yet desirable ability for safe and robust agents. This paper proposes a definition of these environmental "attitudes" based on an characterization of the environment's ability to react to the agent's private strategy. We define… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

    Comments: 13 pages, 9 figures

  30. arXiv:1806.01186  [pdf, other

    cs.LG cs.AI stat.ML

    Penalizing side effects using stepwise relative reachability

    Authors: Victoria Krakovna, Laurent Orseau, Ramana Kumar, Miljan Martic, Shane Legg

    Abstract: How can we design safe reinforcement learning agents that avoid unnecessary disruptions to their environment? We show that current approaches to penalizing side effects can introduce bad incentives, e.g. to prevent any irreversible changes in the environment, including the actions of other agents. To isolate the source of such undesirable incentives, we break down side effects penalties into two c… ▽ More

    Submitted 8 March, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

  31. arXiv:1805.12387  [pdf, other

    cs.LG cs.AI stat.ML

    Agents and Devices: A Relative Definition of Agency

    Authors: Laurent Orseau, Simon McGregor McGill, Shane Legg

    Abstract: According to Dennett, the same system may be described using a `physical' (mechanical) explanatory stance, or using an `intentional' (belief- and goal-based) explanatory stance. Humans tend to find the physical stance more helpful for certain systems, such as planets orbiting a star, and the intentional stance for others, such as living animals. We define a formal counterpart of physical and inten… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

  32. arXiv:1802.01561  [pdf, other

    cs.LG cs.AI

    IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

    Authors: Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

    Abstract: In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distributed agent IMPALA (Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in single-machine training but also… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 February, 2018; originally announced February 2018.

  33. arXiv:1801.08116  [pdf, other

    cs.AI cs.NE q-bio.NC

    Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

    Authors: Joel Z. Leibo, Cyprien de Masson d'Autume, Daniel Zoran, David Amos, Charles Beattie, Keith Anderson, Antonio García Castañeda, Manuel Sanchez, Simon Green, Audrunas Gruslys, Shane Legg, Demis Hassabis, Matthew M. Botvinick

    Abstract: Psychlab is a simulated psychology laboratory inside the first-person 3D game world of DeepMind Lab (Beattie et al. 2016). Psychlab enables implementations of classical laboratory psychological experiments so that they work with both human and artificial agents. Psychlab has a simple and flexible API that enables users to easily create their own tasks. As examples, we are releasing Psychlab implem… ▽ More

    Submitted 4 February, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

    Comments: 28 pages, 11 figures

  34. arXiv:1711.09883  [pdf, other

    cs.LG cs.AI

    AI Safety Gridworlds

    Authors: Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg

    Abstract: We present a suite of reinforcement learning environments illustrating various safety properties of intelligent agents. These problems include safe interruptibility, avoiding side effects, absent supervisor, reward gaming, safe exploration, as well as robustness to self-modification, distributional shift, and adversaries. To measure compliance with the intended safe behavior, we equip each environ… ▽ More

    Submitted 28 November, 2017; v1 submitted 27 November, 2017; originally announced November 2017.

  35. arXiv:1711.08378  [pdf

    cs.AI

    Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017

    Authors: M. Botvinick, D. G. T. Barrett, P. Battaglia, N. de Freitas, D. Kumaran, J. Z Leibo, T. Lillicrap, J. Modayil, S. Mohamed, N. C. Rabinowitz, D. J. Rezende, A. Santoro, T. Schaul, C. Summerfield, G. Wayne, T. Weber, D. Wierstra, S. Legg, D. Hassabis

    Abstract: We agree with Lake and colleagues on their list of key ingredients for building humanlike intelligence, including the idea that model-based reasoning is essential. However, we favor an approach that centers on one additional ingredient: autonomy. In particular, we aim toward agents that can both build and exploit their own internal models, with minimal human hand-engineering. We believe an approac… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  36. arXiv:1711.05074  [pdf, other

    cs.GT cs.MA

    Symmetric Decomposition of Asymmetric Games

    Authors: Karl Tuyls, Julien Perolat, Marc Lanctot, Georg Ostrovski, Rahul Savani, Joel Leibo, Toby Ord, Thore Graepel, Shane Legg

    Abstract: We introduce new theoretical insights into two-population asymmetric games allowing for an elegant symmetric decomposition into two single population symmetric games. Specifically, we show how an asymmetric bimatrix game (A,B) can be decomposed into its symmetric counterparts by envisioning and investigating the payoff tables (A and B) that constitute the asymmetric game, as two independent, singl… ▽ More

    Submitted 17 January, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Paper is published in Scientific Reports; https://www.nature.com/articles/s41598-018-19194-4, 2018

  37. arXiv:1706.10295  [pdf, other

    cs.LG stat.ML

    Noisy Networks for Exploration

    Authors: Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

    Abstract: We introduce NoisyNet, a deep reinforcement learning agent with parametric noise added to its weights, and show that the induced stochasticity of the agent's policy can be used to aid efficient exploration. The parameters of the noise are learned with gradient descent along with the remaining network weights. NoisyNet is straightforward to implement and adds little computational overhead. We find… ▽ More

    Submitted 9 July, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: ICLR 2018

  38. arXiv:1706.03741  [pdf, other

    stat.ML cs.AI cs.HC cs.LG

    Deep reinforcement learning from human preferences

    Authors: Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei

    Abstract: For sophisticated reinforcement learning (RL) systems to interact usefully with real-world environments, we need to communicate complex goals to these systems. In this work, we explore goals defined in terms of (non-expert) human preferences between pairs of trajectory segments. We show that this approach can effectively solve complex RL tasks without access to the reward function, including Atari… ▽ More

    Submitted 17 February, 2023; v1 submitted 12 June, 2017; originally announced June 2017.

  39. arXiv:1705.08417  [pdf, other

    cs.AI cs.LG stat.ML

    Reinforcement Learning with a Corrupted Reward Channel

    Authors: Tom Everitt, Victoria Krakovna, Laurent Orseau, Marcus Hutter, Shane Legg

    Abstract: No real-world reward function is perfect. Sensory errors and software bugs may result in RL agents observing higher (or lower) rewards than they should. For example, a reinforcement learning agent may prefer states where a sensory error gives it the maximum reward, but where the true reward is actually small. We formalise this problem as a generalised Markov Decision Problem called Corrupt Reward… ▽ More

    Submitted 19 August, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: A shorter version of this report was accepted to IJCAI 2017 AI and Autonomy track

    ACM Class: I.2.6; I.2.8

  40. arXiv:1612.03801  [pdf, other

    cs.AI

    DeepMind Lab

    Authors: Charles Beattie, Joel Z. Leibo, Denis Teplyashin, Tom Ward, Marcus Wainwright, Heinrich Küttler, Andrew Lefrancq, Simon Green, Víctor Valdés, Amir Sadik, Julian Schrittwieser, Keith Anderson, Sarah York, Max Cant, Adam Cain, Adrian Bolton, Stephen Gaffney, Helen King, Demis Hassabis, Shane Legg, Stig Petersen

    Abstract: DeepMind Lab is a first-person 3D game platform designed for research and development of general artificial intelligence and machine learning systems. DeepMind Lab can be used to study how autonomous artificial agents may learn complex tasks in large, partially observed, and visually diverse worlds. DeepMind Lab has a simple and flexible API enabling creative task-designs and novel AI-designs to b… ▽ More

    Submitted 13 December, 2016; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: 11 pages, 8 figures

  41. arXiv:1507.04296  [pdf, other

    cs.LG cs.AI cs.DC cs.NE

    Massively Parallel Methods for Deep Reinforcement Learning

    Authors: Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu, David Silver

    Abstract: We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the… ▽ More

    Submitted 16 July, 2015; v1 submitted 15 July, 2015; originally announced July 2015.

    Comments: Presented at the Deep Learning Workshop, International Conference on Machine Learning, Lille, France, 2015

  42. arXiv:1109.5951  [pdf, ps, other

    cs.AI

    An Approximation of the Universal Intelligence Measure

    Authors: Shane Legg, Joel Veness

    Abstract: The Universal Intelligence Measure is a recently proposed formal definition of intelligence. It is mathematically specified, extremely general, and captures the essence of many informal definitions of intelligence. It is based on Hutter's Universal Artificial Intelligence theory, an extension of Ray Solomonoff's pioneering work on universal induction. Since the Universal Intelligence Measure is on… ▽ More

    Submitted 29 September, 2011; v1 submitted 27 September, 2011; originally announced September 2011.

    Comments: 14 pages

  43. arXiv:0810.5631  [pdf, ps, other

    cs.LG cs.AI

    Temporal Difference Updating without a Learning Rate

    Authors: Marcus Hutter, Shane Legg

    Abstract: We derive an equation for temporal difference learning from statistical principles. Specifically, we start with the variational principle and then bootstrap to produce an updating rule for discounted state value estimates. The resulting equation is similar to the standard equation for temporal difference learning with eligibility traces, so called TD(lambda), however it lacks the parameter alpha… ▽ More

    Submitted 31 October, 2008; originally announced October 2008.

    Comments: 12 pages, 6 figures

    Journal ref: Advances in Neural Information Processing Systems 20 (NIPS 2008) pages 705-712

  44. arXiv:0712.3825  [pdf, ps, other

    cs.AI

    Tests of Machine Intelligence

    Authors: Shane Legg, Marcus Hutter

    Abstract: Although the definition and measurement of intelligence is clearly of fundamental importance to the field of artificial intelligence, no general survey of definitions and tests of machine intelligence exists. Indeed few researchers are even aware of alternatives to the Turing test and its many derivatives. In this paper we fill this gap by providing a short survey of the many tests of machine in… ▽ More

    Submitted 21 December, 2007; originally announced December 2007.

    Comments: 12 pages; 1 table. Turing test and derivatives; Compression tests; Linguistic complexity; Multiple cognitive abilities; Competitive games; Psychometric tests; Smith's test; C-test; Universal intelligence

    Report number: IDSIA-11-07

    Journal ref: 50 Years of Artificial Intelligence (2007) pages 232-242

  45. arXiv:0712.3329  [pdf, ps, other

    cs.AI

    Universal Intelligence: A Definition of Machine Intelligence

    Authors: Shane Legg, Marcus Hutter

    Abstract: A fundamental problem in artificial intelligence is that nobody really knows what intelligence is. The problem is especially acute when we need to consider artificial systems which are significantly different to humans. In this paper we approach this problem in the following way: We take a number of well known informal definitions of human intelligence that have been given by experts, and extrac… ▽ More

    Submitted 20 December, 2007; originally announced December 2007.

    Comments: 50 gentle pages

    Report number: IDSIA-10-07

    Journal ref: Minds & Machines, 17:4 (2007) pages 391-444

  46. arXiv:0706.3639  [pdf, ps, other

    cs.AI

    A Collection of Definitions of Intelligence

    Authors: Shane Legg, Marcus Hutter

    Abstract: This paper is a survey of a large number of informal definitions of ``intelligence'' that the authors have collected over the years. Naturally, compiling a complete list would be impossible as many definitions of intelligence are buried deep inside articles and books. Nevertheless, the 70-odd definitions presented here are, to the authors' knowledge, the largest and most well referenced collecti… ▽ More

    Submitted 25 June, 2007; originally announced June 2007.

    Comments: 12 LaTeX pages

    Report number: IDSIA-07-07

    Journal ref: Frontiers in Artificial Intelligence and Applications, Vol.157 (2007) 17-24

  47. Fitness Uniform Optimization

    Authors: Marcus Hutter, Shane Legg

    Abstract: In evolutionary algorithms, the fitness of a population increases with time by mutating and recombining individuals and by a biased selection of more fit individuals. The right selection pressure is critical in ensuring sufficient optimization progress on the one hand and in preserving genetic diversity to be able to escape from local optima on the other hand. Motivated by a universal similarity… ▽ More

    Submitted 20 October, 2006; originally announced October 2006.

    Comments: 25 double-column pages, 12 figures

    Report number: IDSIA-16-06

    Journal ref: IEEE Transactions on Evolutionary Computation, 10:5 (2006) 568-589

  48. arXiv:cs/0606070  [pdf, ps, other

    cs.AI cs.CC

    Is there an Elegant Universal Theory of Prediction?

    Authors: Shane Legg

    Abstract: Solomonoff's inductive learning model is a powerful, universal and highly elegant theory of sequence prediction. Its critical flaw is that it is incomputable and thus cannot be used in practice. It is sometimes suggested that it may still be useful to help guide the development of very general and powerful theories of prediction which are computable. In this paper it is shown that although power… ▽ More

    Submitted 14 June, 2006; originally announced June 2006.

    Report number: IDSIA - 12 - 06

  49. arXiv:cs/0605024  [pdf, ps, other

    cs.AI cs.LG

    A Formal Measure of Machine Intelligence

    Authors: Shane Legg, Marcus Hutter

    Abstract: A fundamental problem in artificial intelligence is that nobody really knows what intelligence is. The problem is especially acute when we need to consider artificial systems which are significantly different to humans. In this paper we approach this problem in the following way: We take a number of well known informal definitions of human intelligence that have been given by experts, and extrac… ▽ More

    Submitted 6 May, 2006; originally announced May 2006.

    Comments: 8 two-column pages

    Report number: IDSIA-10-06

    Journal ref: Proc. 15th Annual Machine Learning Conference of {B}elgium and The Netherlands (Benelearn 2006) pages 73-80

  50. arXiv:cs/0504035  [pdf, ps, other

    cs.NE cs.AI

    Fitness Uniform Deletion: A Simple Way to Preserve Diversity

    Authors: Shane Legg, Marcus Hutter

    Abstract: A commonly experienced problem with population based optimisation methods is the gradual decline in population diversity that tends to occur over time. This can slow a system's progress or even halt it completely if the population converges on a local optimum from which it cannot escape. In this paper we present the Fitness Uniform Deletion Scheme (FUDS), a simple but somewhat unconventional app… ▽ More

    Submitted 11 April, 2005; originally announced April 2005.

    Comments: 8 two-column pages, 19 figures

    Report number: IDSIA-11-04 ACM Class: I.2.M

    Journal ref: Proc. Genetic and Evolutionary Computation Conference (GECCO 2005) 1271-1278