Skip to main content

Showing 1–7 of 7 results for author: Tieleman, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.05314  [pdf, other

    cs.LG cs.AI

    Large-Scale Retrieval for Reinforcement Learning

    Authors: Peter C. Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Théophane Weber, Timothy Lillicrap

    Abstract: Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm is for an agent to amortise information that helps decision making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale… ▽ More

    Submitted 16 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022), 16 pages

  2. arXiv:2101.05125  [pdf, other

    cs.AI

    Formalising Concepts as Grounded Abstractions

    Authors: Stephen Clark, Alexander Lerchner, Tamara von Glehn, Olivier Tieleman, Richard Tanburn, Misha Dashevskiy, Matko Bosnjak

    Abstract: The notion of concept has been studied for centuries, by philosophers, linguists, cognitive scientists, and researchers in artificial intelligence (Margolis & Laurence, 1999). There is a large literature on formal, mathematical models of concepts, including a whole sub-field of AI -- Formal Concept Analysis -- devoted to this topic (Ganter & Obiedkov, 2016). Recently, researchers in machine learni… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  3. arXiv:2009.01719  [pdf, other

    cs.CL cs.AI

    Grounded Language Learning Fast and Slow

    Authors: Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, Stephen Clark

    Abstract: Recent work has shown that large text-based neural language models, trained with conventional supervised learning objectives, acquire a surprising propensity for few- and one-shot learning. Here, we show that an embodied agent situated in a simulated 3D world, and endowed with a novel dual-coding external memory, can exhibit similar one-shot word learning when trained with conventional reinforceme… ▽ More

    Submitted 14 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

  4. arXiv:2005.07064  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

    Authors: Angeliki Lazaridou, Anna Potapenko, Olivier Tieleman

    Abstract: We present a method for combining multi-agent communication and traditional data-driven approaches to natural language learning, with an end goal of teaching agents to communicate with humans in natural language. Our starting point is a language model that has been trained on generic, not task-specific language data. We then place this model in a multi-agent self-play environment that generates ta… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: to appear at ACL 2020

  5. arXiv:2002.06038  [pdf, other

    cs.LG stat.ML

    Never Give Up: Learning Directed Exploration Strategies

    Authors: Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

    Abstract: We propose a reinforcement learning agent to solve hard exploration games by learning a range of directed exploratory policies. We construct an episodic memory-based intrinsic reward using k-nearest neighbors over the agent's recent experience to train the directed exploratory policies, thereby encouraging the agent to repeatedly revisit all states in its environment. A self-supervised inverse dyn… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper in ICLR 2020

  6. arXiv:1912.06208  [pdf, other

    cs.CL cs.NE

    Sha** representations through communication: community size effect in artificial learning systems

    Authors: Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell, Doina Precup

    Abstract: Motivated by theories of language and communication that explain why communities with large numbers of speakers have, on average, simpler languages with more regularity, we cast the representation learning problem in terms of learning to communicate. Our starting point sees the traditional autoencoder setup as a single encoder with a fixed decoder partner that must learn to communicate. Generalizi… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: NeurIPS 2019 workshop on visually grounded interaction and language

  7. arXiv:1812.07480  [pdf, other

    stat.ML cs.AI cs.LG

    A Factorial Mixture Prior for Compositional Deep Generative Models

    Authors: Ulrich Paquet, Sumedh K. Ghaisas, Olivier Tieleman

    Abstract: We assume that a high-dimensional datum, like an image, is a compositional expression of a set of properties, with a complicated non-linear relationship between the datum and its properties. This paper proposes a factorial mixture prior for capturing latent properties, thereby adding structured compositionality to deep generative models. The prior treats a latent vector as belonging to Cartesian p… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 16 pagers, 10 figures