Skip to main content

Showing 1–19 of 19 results for author: Raposo, D

.
  1. arXiv:2404.02258  [pdf, other

    cs.LG cs.CL

    Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Authors: David Raposo, Sam Ritter, Blake Richards, Timothy Lillicrap, Peter Conway Humphreys, Adam Santoro

    Abstract: Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate FLOPs (or compute) to specific positions in a sequence, optimising the allocation along the sequence for different layers across the model depth. Our method enforces a total compute budget by cap** the number of tokens ($k$) that… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2312.03635  [pdf, other

    cs.NI

    Towards Time Sensitive Networking on Smart Cities: Techniques, Challenges, and Solutions

    Authors: Rui Lopes, Duarte Raposo, Susana Sargento

    Abstract: The rapid proliferation of smart cities has transformed urban landscapes into dynamic ecosystems teeming with interconnected computational nodes and sensors. During this evolution, the search for seamless communication in time-critical scenarios has become evident. With the escalating complexity of urban environments, envisioning a future with a blend of autonomous and conventional systems, each d… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    ACM Class: C.2.1

  3. arXiv:2207.12200  [pdf, other

    cs.NI

    Aveiro Tech City Living Lab: A Communication, Sensing and Computing Platform for City Environments

    Authors: Pedro Rito, Ana Almeida, Andreia Figueiredo, Christian Gomes, Pedro Teixeira, Rodrigo Rosmaninho, Rui Lopes, Duarte Dias, Gonçalo Vítor, Gonçalo Perna, Miguel Silva, Carlos Senna, Duarte Raposo, Miguel Luís, Susana Sargento, Arnaldo Oliveira, Nuno Borges de Carvalho

    Abstract: This article presents the deployment and experimentation architecture of the Aveiro Tech City Living Lab (ATCLL) in Aveiro, Portugal. This platform comprises a large number of Internet-of-Things devices with communication, sensing and computing capabilities. The communication infrastructure, built on fiber and Millimeter-wave (mmWave) links, integrates a communication network with radio terminals… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    ACM Class: C.2.1

  4. arXiv:2206.05396  [pdf, other

    math.GM

    A systematic approach on some relevant theorems that follows from Kolmogorov's axioms

    Authors: Diego J. Raposo

    Abstract: A selection of the relevant theorems of Probability Theory that comes directly from Kolmogorov's axioms, Set Theory basic results, definitions and rules of inference are listed and proven in a systematic approach, aiming the student who seeks a self-contained account on the matter before moving to more advanced material.

    Submitted 10 June, 2022; originally announced June 2022.

    MSC Class: 60-01; 60A05

  5. arXiv:2205.00793  [pdf, other

    cs.IT cs.NI

    Ultra-Reliable Low-Latency Millimeter-Wave Communications with Sliding Window Network Coding

    Authors: Eurico Dias, Duarte Raposo, Homa Esfahanizadeh, Alejandro Cohen, Tânia Ferreira, Miguel Luís, Susana Sargento, Muriel Médard

    Abstract: Ultra-reliability and low-latency are pivotal requirements of the new 6th generation of communication systems (xURLLC). Over the past years, to increase throughput, adaptive active antennas were introduced in advanced wireless communications, specifically in the domain of millimeter-wave (mmWave). Consequently, new lower-layer techniques were proposed to cope with practical challenges of high dime… ▽ More

    Submitted 15 September, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

  6. arXiv:2202.08137  [pdf, other

    cs.LG

    A data-driven approach for learning to control computers

    Authors: Peter C Humphreys, David Raposo, Toby Pohlen, Gregory Thornton, Rachita Chhaparia, Alistair Muldal, Josh Abramson, Petko Georgiev, Alex Goldin, Adam Santoro, Timothy Lillicrap

    Abstract: It would be useful for machines to use computers as humans do so that they can aid us in everyday tasks. This is a setting in which there is also the potential to leverage large-scale expert demonstrations and human judgements of interactive behaviour, which are two ingredients that have driven much recent success in AI. Here we investigate the setting of computer control using keyboard and mouse,… ▽ More

    Submitted 11 November, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  7. arXiv:2102.12425  [pdf, other

    cs.LG

    Synthetic Returns for Long-Term Credit Assignment

    Authors: David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

    Abstract: Since the earliest days of reinforcement learning, the workhorse method for assigning credit to actions over time has been temporal-difference (TD) learning, which propagates credit backward timestep-by-timestep. This approach suffers when delays between actions and rewards are long and when intervening unrelated events contribute variance to long-term returns. We propose state-associative (SA) le… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  8. arXiv:2102.03406  [pdf, other

    cs.AI cs.LG

    Symbolic Behaviour in Artificial Intelligence

    Authors: Adam Santoro, Andrew Lampinen, Kory Mathewson, Timothy Lillicrap, David Raposo

    Abstract: The ability to use symbols is the pinnacle of human intelligence, but has yet to be fully replicated in machines. Here we argue that the path towards symbolically fluent artificial intelligence (AI) begins with a reinterpretation of what symbols are, how they come to exist, and how a system behaves when it uses them. We begin by offering an interpretation of symbols as entities whose meaning is es… ▽ More

    Submitted 21 January, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

  9. arXiv:2010.00343  [pdf, other

    cs.NI cs.IT

    Bringing Network Coding into SDN: A Case-study for Highly Meshed Heterogeneous Communications

    Authors: Alejandro Cohen, Homa Esfahanizadeh, Bruno Sousa, João P. Vilela, Miguel Luís, Duarte Raposo, Francois Michel, Susana Sargento, Muriel Médard

    Abstract: Modern communications have moved away from point-to-point models to increasingly heterogeneous network models. In this article, we propose a novel controller-based protocol to deploy adaptive causal network coding in heterogeneous and highly-meshed communication networks. Specifically, we consider using Software-Defined-Network (SDN) as the main controller. We first present an architecture for the… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

  10. arXiv:2006.03662  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Rapid Task-Solving in Novel Environments

    Authors: Sam Ritter, Ryan Faulkner, Laurent Sartran, Adam Santoro, Matt Botvinick, David Raposo

    Abstract: We propose the challenge of rapid task-solving in novel environments (RTS), wherein an agent must solve a series of tasks as rapidly as possible in an unfamiliar environment. An effective RTS agent must balance between exploring the unfamiliar environment and solving its current task, all while building a model of the new environment over which it can plan when faced with later tasks. While modern… ▽ More

    Submitted 19 April, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

  11. arXiv:1904.10396  [pdf, other

    q-bio.NC cs.AI cs.LG

    Is coding a relevant metaphor for building AI? A commentary on "Is coding a relevant metaphor for the brain?", by Romain Brette

    Authors: Adam Santoro, Felix Hill, David Barrett, David Raposo, Matthew Botvinick, Timothy Lillicrap

    Abstract: Brette contends that the neural coding metaphor is an invalid basis for theories of what the brain does. Here, we argue that it is an insufficient guide for building an artificial intelligence that learns to accomplish short- and long-term goals in a complex, changing environment.

    Submitted 18 April, 2019; originally announced April 2019.

  12. arXiv:1901.08162  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Reasoning from Meta-reinforcement Learning

    Authors: Ishita Dasgupta, Jane Wang, Silvia Chiappa, Jovana Mitrovic, Pedro Ortega, David Raposo, Edward Hughes, Peter Battaglia, Matthew Botvinick, Zeb Kurth-Nelson

    Abstract: Discovering and exploiting the causal structure in the environment is a crucial challenge for intelligent agents. Here we explore whether causal reasoning can emerge via meta-reinforcement learning. We train a recurrent network with model-free reinforcement learning to solve a range of problems that each contain causal structure. We find that the trained agent can perform causal reasoning in novel… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  13. arXiv:1901.03559  [pdf, other

    cs.LG cs.AI stat.ML

    An investigation of model-free planning

    Authors: Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

    Abstract: The field of reinforcement learning (RL) is facing increasingly challenging domains with combinatorial complexity. For an RL agent to address these challenges, it is essential that it can plan effectively. Prior work has typically utilized an explicit model of the environment, combined with a specific planning algorithm (such as tree search). More recently, a new family of methods have been propos… ▽ More

    Submitted 20 May, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

  14. arXiv:1806.01830  [pdf, other

    cs.LG stat.ML

    Relational Deep Reinforcement Learning

    Authors: Vinicius Zambaldi, David Raposo, Adam Santoro, Victor Bapst, Yujia Li, Igor Babuschkin, Karl Tuyls, David Reichert, Timothy Lillicrap, Edward Lockhart, Murray Shanahan, Victoria Langston, Razvan Pascanu, Matthew Botvinick, Oriol Vinyals, Peter Battaglia

    Abstract: We introduce an approach for deep reinforcement learning (RL) that improves upon the efficiency, generalization capacity, and interpretability of conventional approaches through structured perception and relational reasoning. It uses self-attention to iteratively reason about the relations between entities in a scene and to guide a model-free policy. Our results show that in a novel navigation and… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  15. arXiv:1806.01822  [pdf, other

    cs.LG stat.ML

    Relational recurrent neural networks

    Authors: Adam Santoro, Ryan Faulkner, David Raposo, Jack Rae, Mike Chrzanowski, Theophane Weber, Daan Wierstra, Oriol Vinyals, Razvan Pascanu, Timothy Lillicrap

    Abstract: Memory-based neural networks model temporal data by leveraging an ability to remember information for long periods. It is unclear, however, whether they also have an ability to perform complex relational reasoning with the information they remember. Here, we first confirm our intuitions that standard memory architectures may struggle at tasks that heavily involve an understanding of the ways in wh… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  16. arXiv:1806.01261  [pdf, other

    cs.LG cs.AI stat.ML

    Relational inductive biases, deep learning, and graph networks

    Authors: Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals , et al. (2 additional authors not shown)

    Abstract: Artificial intelligence (AI) has undergone a renaissance recently, making major progress in key domains such as vision, language, control, and decision-making. This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. However, many defining characteristics of human intelligence, which developed under much different pressures, rema… ▽ More

    Submitted 17 October, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

  17. arXiv:1805.09786  [pdf, other

    cs.NE

    Hyperbolic Attention Networks

    Authors: Caglar Gulcehre, Misha Denil, Mateusz Malinowski, Ali Razavi, Razvan Pascanu, Karl Moritz Hermann, Peter Battaglia, Victor Bapst, David Raposo, Adam Santoro, Nando de Freitas

    Abstract: We introduce hyperbolic attention networks to endow neural networks with enough capacity to match the complexity of data with hierarchical and power-law structure. A few recent approaches have successfully demonstrated the benefits of imposing hyperbolic geometry on the parameters of shallow networks. We extend this line of work by imposing hyperbolic geometry on the activations of neural networks… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

  18. arXiv:1706.01427  [pdf, other

    cs.CL cs.LG

    A simple neural network module for relational reasoning

    Authors: Adam Santoro, David Raposo, David G. T. Barrett, Mateusz Malinowski, Razvan Pascanu, Peter Battaglia, Timothy Lillicrap

    Abstract: Relational reasoning is a central component of generally intelligent behavior, but has proven difficult for neural networks to learn. In this paper we describe how to use Relation Networks (RNs) as a simple plug-and-play module to solve problems that fundamentally hinge on relational reasoning. We tested RN-augmented networks on three tasks: visual question answering using a challenging dataset ca… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.

  19. arXiv:1702.05068  [pdf, other

    cs.LG cs.CV

    Discovering objects and their relations from entangled scene representations

    Authors: David Raposo, Adam Santoro, David Barrett, Razvan Pascanu, Timothy Lillicrap, Peter Battaglia

    Abstract: Our world can be succinctly and compactly described as structured scenes of objects and relations. A typical room, for example, contains salient objects such as tables, chairs and books, and these objects typically relate to each other by their underlying causes and semantics. This gives rise to correlated features, such as position, function and shape. Humans exploit knowledge of objects and thei… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

    Comments: ICLR Workshop 2017