Skip to main content

Showing 1–18 of 18 results for author: Dayan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01870  [pdf, other

    cs.MA cs.GT

    Detecting and Deterring Manipulation in a Cognitive Hierarchy

    Authors: Nitay Alon, Lion Schulz, Joseph M. Barnby, Jeffrey S. Rosenschein, Peter Dayan

    Abstract: Social agents with finitely nested opponent models are vulnerable to manipulation by agents with deeper reasoning and more sophisticated opponent modelling. This imbalance, rooted in logic and the theory of recursive modelling frameworks, cannot be solved directly. We propose a computational framework, $\aleph$-IPOMDP, augmenting model-based RL agents' Bayesian inference with an anomaly detection… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  2. arXiv:2405.00899  [pdf, other

    cs.HC cs.AI cs.CL q-bio.NC

    Characterising the Creative Process in Humans and Large Language Models

    Authors: Surabhi S. Nath, Peter Dayan, Claire Stevenson

    Abstract: Large language models appear quite creative, often performing on par with the average human on creative tasks. However, research on LLM creativity has focused solely on \textit{products}, with little attention on the creative \textit{process}. Process analyses of human creativity often require hand-coded categories or exploit response times, which do not apply to LLMs. We provide an automated meth… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  3. arXiv:2403.03134  [pdf, other

    cs.CV cs.AI q-bio.NC

    Simplicity in Complexity : Explaining Visual Complexity using Deep Segmentation Models

    Authors: Tingke Shen, Surabhi S Nath, Aenne Brielmann, Peter Dayan

    Abstract: The complexity of visual stimuli plays an important role in many cognitive phenomena, including attention, engagement, memorability, time perception and aesthetic evaluation. Despite its importance, complexity is poorly understood and ironically, previous models of image complexity have been quite complex. There have been many attempts to find handcrafted features that explain complexity, but thes… ▽ More

    Submitted 6 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2401.17835  [pdf, other

    cs.LG

    Predicting the Future with Simple World Models

    Authors: Tankred Saanum, Peter Dayan, Eric Schulz

    Abstract: World models can represent potentially high-dimensional pixel observations in compact latent spaces, making it tractable to model the dynamics of the environment. However, the latent dynamics inferred by these models may still be highly complex. Abstracting the dynamics of the environment with simple models can have several benefits. If the latent dynamics are simple, the model may generalize bett… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  5. arXiv:2307.01784  [pdf, other

    cs.CL cs.AI

    The Inner Sentiments of a Thought

    Authors: Chris Gagne, Peter Dayan

    Abstract: Transformer-based large-scale language models (LLMs) are able to generate highly realistic text. They are duly able to express, and at least implicitly represent, a wide range of sentiments and color, from the obvious, such as valence and arousal to the subtle, such as determination and admiration. We provide a first exploration of these representations and how they can be used for understanding t… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  6. arXiv:2306.05298  [pdf, other

    cs.AI

    Habits of Mind: Reusing Action Sequences for Efficient Planning

    Authors: Noémi Éltető, Peter Dayan

    Abstract: When we exercise sequences of actions, their execution becomes more fluent and precise. Here, we consider the possibility that exercised action sequences can also be used to make planning faster and more accurate by focusing expansion of the search tree on paths that have been frequently used in the past, and by reducing deep planning problems to shallow ones via multi-step jumps in the tree. To c… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  7. arXiv:2305.17109  [pdf, other

    cs.LG

    Reinforcement Learning with Simple Sequence Priors

    Authors: Tankred Saanum, Noémi Éltető, Peter Dayan, Marcel Binz, Eric Schulz

    Abstract: Everything else being equal, simpler models should be preferred over more complex ones. In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis -- but this timescale ignores temporal regularities, like repetitions, often present in sequential strategies. We therefore propose an RL algorithm that learns to solve tasks with sequences of actions that are compre… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2111.06804  [pdf, other

    cs.AI

    Catastrophe, Compounding & Consistency in Choice

    Authors: Chris Gagne, Peter Dayan

    Abstract: Conditional value-at-risk (CVaR) precisely characterizes the influence that rare, catastrophic events can exert over decisions. Such characterizations are important for both normal decision-making and for psychiatric conditions such as anxiety disorders -- especially for sequences of decisions that might ultimately lead to disaster. CVaR, like other well-founded risk measures, compounds in complex… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  9. arXiv:2111.06803  [pdf, other

    cs.AI

    Two steps to risk sensitivity

    Authors: Chris Gagne, Peter Dayan

    Abstract: Distributional reinforcement learning (RL) -- in which agents learn about all the possible long-term consequences of their actions, and not just the expected value -- is of great recent interest. One of the most important affordances of a distributional view is facilitating a modern, measured, approach to risk when outcomes are not completely certain. By contrast, psychological and neuroscientific… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  10. arXiv:2010.01192  [pdf, other

    cs.LG cs.AI cs.MA

    Correcting Experience Replay for Multi-Agent Communication

    Authors: Sanjeevan Ahilan, Peter Dayan

    Abstract: We consider the problem of learning to communicate using multi-agent reinforcement learning (MARL). A common approach is to learn off-policy, using data sampled from a replay buffer. However, messages received in the past may not accurately reflect the current communication policy of each agent, and this complicates learning. We therefore introduce a 'communication correction' which accounts for t… ▽ More

    Submitted 28 February, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

  11. arXiv:2002.04335  [pdf, other

    cs.AI cs.LG

    Static and Dynamic Values of Computation in MCTS

    Authors: Eren Sezener, Peter Dayan

    Abstract: Monte-Carlo Tree Search (MCTS) is one of the most-widely used methods for planning, and has powered many recent advances in artificial intelligence. In MCTS, one typically performs computations (i.e., simulations) to collect statistics about the possible future consequences of actions, and then chooses accordingly. Many popular MCTS methods such as UCT and its variants decide which computations to… ▽ More

    Submitted 19 November, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: Presented in UAI 2020

    Journal ref: PMLR 124:31-40, 2020

  12. arXiv:1901.08492  [pdf, other

    cs.MA cs.AI cs.LG

    Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning

    Authors: Sanjeevan Ahilan, Peter Dayan

    Abstract: We investigate how reinforcement learning agents can learn to cooperate. Drawing inspiration from human societies, in which successful coordination of many individuals is often facilitated by hierarchical organisation, we introduce Feudal Multi-agent Hierarchies (FMH). In this framework, a 'manager' agent, which is tasked with maximising the environmentally-determined reward function, learns to co… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

  13. arXiv:1810.00555  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Meta-Representations Of Neural Networks

    Authors: Theofanis Karaletsos, Peter Dayan, Zoubin Ghahramani

    Abstract: Existing Bayesian treatments of neural networks are typically characterized by weak prior and approximate posterior distributions according to which all the weights are drawn independently. Here, we consider a richer prior distribution in which units in the network are represented by latent variables, and the weights between units are drawn conditionally on the values of the collection of those va… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: presented at UAI 2018 Uncertainty In Deep Learning Workshop (UDL AUG. 2018)

  14. arXiv:1803.10049  [pdf, other

    cs.LG stat.ML

    Fast Parametric Learning with Activation Memorization

    Authors: Jack W Rae, Chris Dyer, Peter Dayan, Timothy P Lillicrap

    Abstract: Neural networks trained with backpropagation often struggle to identify classes that have been observed a small number of times. In applications where most class labels are rare, such as language modelling, this can become a performance bottleneck. One potential remedy is to augment the network with a fast-learning non-parametric model which stores recent activations and class labels into an exter… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.

  15. arXiv:1705.05263  [pdf, other

    cs.LG

    Comparison of Maximum Likelihood and GAN-based training of Real NVPs

    Authors: Ivo Danihelka, Balaji Lakshminarayanan, Benigno Uria, Daan Wierstra, Peter Dayan

    Abstract: We train a generator by maximum likelihood and we also train the same generator architecture by Wasserstein GAN. We then compare the generated samples, exact log-probability densities and approximate Wasserstein distances. We show that an independent critic trained to approximate Wasserstein distance between the validation set and the generator distribution helps detect overfitting. Finally, we us… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

  16. Formalizing Neurath's Ship: Approximate Algorithms for Online Causal Learning

    Authors: Neil R. Bramley, Peter Dayan, Thomas L. Griffiths, David A. Lagnado

    Abstract: Higher-level cognition depends on the ability to learn models of the world. We can characterize this at the computational level as a structure-learning problem with the goal of best identifying the prevailing causal relationships among a set of relata. However, the computational cost of performing exact Bayesian inference over causal models grows rapidly as the number of relata increases. This imp… ▽ More

    Submitted 26 May, 2017; v1 submitted 14 September, 2016; originally announced September 2016.

    Journal ref: Psychological Review, Vol 124(3), Apr 2017, 301-338

  17. arXiv:1402.1958  [pdf, other

    cs.AI cs.LG stat.ML

    Better Optimism By Bayes: Adaptive Planning with Rich Models

    Authors: Arthur Guez, David Silver, Peter Dayan

    Abstract: The computational costs of inference and planning have confined Bayesian model-based reinforcement learning to one of two dismal fates: powerful Bayes-adaptive planning but only for simplistic models, or powerful, Bayesian non-parametric models but using simple, myopic planning strategies such as Thompson sampling. We ask whether it is feasible and truly beneficial to combine rich probabilistic mo… ▽ More

    Submitted 9 February, 2014; originally announced February 2014.

    Comments: 11 pages, 11 figures

  18. arXiv:1205.3109  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

    Authors: Arthur Guez, David Silver, Peter Dayan

    Abstract: Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal policies is notoriously taxing, since the search space becomes enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optima… ▽ More

    Submitted 18 December, 2013; v1 submitted 14 May, 2012; originally announced May 2012.

    Comments: 14 pages, 7 figures, includes supplementary material. Advances in Neural Information Processing Systems (NIPS) 2012

    Journal ref: (2012) Advances in Neural Information Processing Systems 25, pages 1034-1042