Skip to main content

Showing 1–48 of 48 results for author: Hadsell, R

.
  1. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, an open language model which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide a pre-trained model with 2B non-embedding parameters, and an instruction tuned var… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2308.15975  [pdf, other

    cs.RO cs.AI cs.CV

    RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

    Authors: Mel Vecerik, Carl Doersch, Yi Yang, Todor Davchev, Yusuf Aytar, Guangyao Zhou, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: For robots to be useful outside labs and specialized factories we need a way to teach them new useful behaviors quickly. Current approaches lack either the generality to onboard new tasks without task-specific engineering, or else lack the data-efficiency to do so in an amount of time that enables practical use. In this work we explore dense tracking as a representational vehicle to allow faster a… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Project website: https://robotap.github.io

  3. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  4. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, lea**, and jum**. There is a growing interest in develo** legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  5. Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

    Authors: Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Jan Humplik, Markus Wulfmeier, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley , et al. (3 additional authors not shown)

    Abstract: We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic environments. We used Deep RL to train a humanoid robot with 20 actuated joints to play a simplified one-versus-one (1v1) soccer game. The resulting agent exhibits robust… ▽ More

    Submitted 11 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Project website: https://sites.google.com/view/op3-soccer

  6. arXiv:2210.13982  [pdf, other

    cs.LG cs.CR

    Hindering Adversarial Attacks with Implicit Neural Representations

    Authors: Andrei A. Rusu, Dan A. Calian, Sven Gowal, Raia Hadsell

    Abstract: We introduce the Lossy Implicit Network Activation Coding (LINAC) defence, an input transformation which successfully hinders several common adversarial attacks on CIFAR-$10$ classifiers for perturbations up to $ε= 8/255$ in $L_\infty$ norm and $ε= 0.5$ in $L_2$ norm. Implicit neural representations are used to approximately encode pixel colour intensities in $2\text{D}$ images such that classifie… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Journal ref: PMLR 162 (2022) 18910-18934

  7. arXiv:2210.12448  [pdf, other

    cs.LG

    Probing Transfer in Deep Reinforcement Learning without Task Engineering

    Authors: Andrei A. Rusu, Sebastian Flennerhag, Dushyant Rao, Razvan Pascanu, Raia Hadsell

    Abstract: We evaluate the use of original game curricula supported by the Atari 2600 console as a heterogeneous transfer benchmark for deep reinforcement learning agents. Game designers created curricula using combinations of several discrete modifications to the basic versions of games such as Space Invaders, Breakout and Freeway, making them progressively more challenging for human players. By formally or… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

  8. arXiv:2209.01947  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    MO2: Model-Based Offline Options

    Authors: Sasha Salter, Markus Wulfmeier, Dhruva Tirumala, Nicolas Heess, Martin Riedmiller, Raia Hadsell, Dushyant Rao

    Abstract: The ability to discover useful behaviours from past experience and transfer them to new tasks is considered a core component of natural embodied intelligence. Inspired by neuroscience, discovering behaviours that switch at bottleneck states have been long sought after for inducing plans of minimum description length across tasks. Prior approaches have either only supported online, on-policy, bottl… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs) Conference Track, 2022

  9. arXiv:2205.15659  [pdf, other

    cs.LG cs.DS stat.ML

    The CLRS Algorithmic Reasoning Benchmark

    Authors: Petar Veličković, Adrià Puigdomènech Badia, David Budden, Razvan Pascanu, Andrea Banino, Misha Dashevskiy, Raia Hadsell, Charles Blundell

    Abstract: Learning representations of algorithms is an emerging area of machine learning, seeking to bridge concepts from neural networks with classical algorithms. Several important works have investigated whether neural networks can effectively reason like algorithms, typically by learning to execute them. The common trend in the area, however, is to generate targeted kinds of algorithmic data to evaluate… ▽ More

    Submitted 4 June, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: To appear in ICML 2022. 19 pages, 4 figures

  10. arXiv:2205.06175  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    A Generalist Agent

    Authors: Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

    Abstract: Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, dec… ▽ More

    Submitted 11 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Published at TMLR, 42 pages

    Journal ref: Transactions on Machine Learning Research, 11/2022, https://openreview.net/forum?id=1ikK0kHjvj

  11. arXiv:2203.17138  [pdf, other

    cs.RO cs.AI cs.LG

    Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors

    Authors: Steven Bohez, Saran Tunyasuvunakool, Philemon Brakel, Fereshteh Sadeghi, Leonard Hasenclever, Yuval Tassa, Emilio Parisotto, Jan Humplik, Tuomas Haarnoja, Roland Hafner, Markus Wulfmeier, Michael Neunert, Ben Moran, Noah Siegel, Andrea Huber, Francesco Romano, Nathan Batchelor, Federico Casarini, Josh Merel, Raia Hadsell, Nicolas Heess

    Abstract: We investigate the use of prior knowledge of human and animal movement to learn reusable locomotion skills for real legged robots. Our approach builds upon previous work on imitating human or dog Motion Capture (MoCap) data to learn a movement skill module. Once learned, this skill module can be reused for complex downstream tasks. Importantly, due to the prior imposed by the MoCap data, our appro… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: 30 pages, 9 figures, 8 tables, 14 videos at https://bit.ly/robot-npmp , submitted to Science Robotics

  12. arXiv:2112.05062  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

    Authors: Dushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell

    Abstract: For robots operating in the real world, it is desirable to learn reusable behaviours that can effectively be transferred and adapted to numerous tasks and scenarios. We propose an approach to learn abstract motor skills from data using a hierarchical mixture latent variable model. In contrast to existing work, our method exploits a three-level hierarchy of both discrete and continuous latent varia… ▽ More

    Submitted 14 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  13. arXiv:2112.04910  [pdf, other

    cs.RO cs.CV

    Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings

    Authors: Mel Vecerik, Jackie Kay, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: Dense object tracking, the ability to localize specific object points with pixel-level accuracy, is an important computer vision task with numerous downstream applications in robotics. Existing approaches either compute dense keypoint embeddings in a single forward pass, meaning the model is trained to track everything at once, or allocate their full capacity to a sparse predefined set of points,… ▽ More

    Submitted 13 December, 2021; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Supplementary material available at: https://sites.google.com/view/2021-tack

  14. arXiv:2110.06192  [pdf, other

    cs.RO cs.LG

    Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

    Authors: Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

    Abstract: We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can ef… ▽ More

    Submitted 3 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: CoRL 2021. Video: https://dpmd.ai/robotics-stacking-YT . Blog: https://dpmd.ai/robotics-stacking . Code: https://github.com/deepmind/rgb_stacking

  15. arXiv:2107.08881  [pdf, other

    cs.LG cs.AI stat.ML

    Reasoning-Modulated Representations

    Authors: Petar Veličković, Matko Bošnjak, Thomas Kipf, Alexander Lerchner, Raia Hadsell, Razvan Pascanu, Charles Blundell

    Abstract: Neural networks leverage robust internal representations in order to generalise. Learning them is difficult, and often requires a large training set that covers the data distribution densely. We study a common setting where our task is not purely opaque. Indeed, very often we may have access to information about the underlying system (e.g. that observations must obey certain laws of physics) that… ▽ More

    Submitted 3 December, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: To appear at LoG 2022. 17 pages, 5 figures

  16. arXiv:2106.08796  [pdf, other

    cs.RO cs.AI

    Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation

    Authors: Alex Church, John Lloyd, Raia Hadsell, Nathan F. Lepora

    Abstract: Simulation has recently become key for deep reinforcement learning to safely and efficiently acquire general and complex control policies from visual and proprioceptive inputs. Tactile information is not usually considered despite its direct relation to environment interaction. In this work, we present a suite of simulated environments tailored towards tactile robotics and reinforcement learning.… ▽ More

    Submitted 31 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  17. arXiv:2106.08199  [pdf, other

    cs.LG cs.RO

    On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

    Abstract: Many advances that have improved the robustness and efficiency of deep reinforcement learning (RL) algorithms can, in one way or another, be understood as introducing additional objectives or constraints in the policy optimization step. This includes ideas as far ranging as exploration bonuses, entropy regularization, and regularization toward teachers or data priors. Often, the task reward and au… ▽ More

    Submitted 1 August, 2023; v1 submitted 15 June, 2021; originally announced June 2021.

  18. Skillful Precipitation Nowcasting using Deep Generative Models of Radar

    Authors: Suman Ravuri, Karel Lenc, Matthew Willson, Dmitry Kangin, Remi Lam, Piotr Mirowski, Megan Fitzsimons, Maria Athanassiadou, Sheleem Kashem, Sam Madge, Rachel Prudden, Amol Mandhane, Aidan Clark, Andrew Brock, Karen Simonyan, Raia Hadsell, Niall Robinson, Ellen Clancy, Alberto Arribas, Shakir Mohamed

    Abstract: Precipitation nowcasting, the high-resolution forecasting of precipitation up to two hours ahead, supports the real-world socio-economic needs of many sectors reliant on weather-dependent decision-making. State-of-the-art operational nowcasting methods typically advect precipitation fields with radar-based wind estimates, and struggle to capture important non-linear events such as convective initi… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 46 pages, 17 figures, 2 tables

  19. arXiv:2101.08616  [pdf, other

    cs.RO

    Learning rich touch representations through cross-modal self-supervision

    Authors: Martina Zambelli, Yusuf Aytar, Francesco Visin, Yuxiang Zhou, Raia Hadsell

    Abstract: The sense of touch is fundamental in several manipulation tasks, but rarely used in robot manipulation. In this work we tackle the problem of learning rich touch features from cross-modal self-supervision. We evaluate them identifying objects and their properties in a few-shot classification setting. Two new datasets are introduced using a simulated anthropomorphic robotic hand equipped with tacti… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  20. arXiv:2009.14711  [pdf, other

    cs.RO cs.CV cs.LG

    S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

    Authors: Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov, David Barker, Rugile Pevceviciute, Thomas Rothörl, Christopher Schuster, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

    Abstract: A robot's ability to act is fundamentally constrained by what it can perceive. Many existing approaches to visual representation learning utilize general-purpose training criteria, e.g. image reconstruction, smoothness in latent space, or usefulness for control, or else make use of large datasets annotated with specific features (bounding boxes, segmentations, etc.). However, both approaches often… ▽ More

    Submitted 13 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 11 pages, supplementary material available at: https://sites.google.com/view/2020-s3k/home

  21. arXiv:2008.02646  [pdf, other

    cs.RO cs.AI

    Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard

    Authors: Alex Church, John Lloyd, Raia Hadsell, Nathan F. Lepora

    Abstract: Artificial touch would seem well-suited for Reinforcement Learning (RL), since both paradigms rely on interaction with an environment. Here we propose a new environment and set of tasks to encourage development of tactile reinforcement learning: learning to type on a braille keyboard. Four tasks are proposed, progressing in difficulty from arrow to alphabet keys and from discrete to continuous act… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted in RAL and IROS 2020

  22. arXiv:2005.07513  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Distributional View on Multi-Objective Policy Optimization

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller

    Abstract: Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units and/or scales, which can make it challenging for practitioners to express numerical preferences over objectives in their native units. In this paper we propose a novel algorithm for multi-objective reinforcement learning that enables setting desired preferences for obj… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  23. arXiv:1911.10866  [pdf, other

    cs.LG stat.ML

    Disentangled Cumulants Help Successor Representations Transfer to New Tasks

    Authors: Christopher Grimm, Irina Higgins, Andre Barreto, Denis Teplyashin, Markus Wulfmeier, Tim Hertweck, Raia Hadsell, Satinder Singh

    Abstract: Biological intelligence can learn to solve many diverse tasks in a data efficient manner by re-using basic knowledge and skills from one task to another. Furthermore, many of such skills are acquired without explicit supervision in an intrinsically driven fashion. This is in contrast to the state-of-the-art reinforcement learning agents, which typically start learning each new task from scratch an… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

  24. arXiv:1911.08363  [pdf, other

    cs.AI cs.LG

    Attention-Privileged Reinforcement Learning

    Authors: Sasha Salter, Dushyant Rao, Markus Wulfmeier, Raia Hadsell, Ingmar Posner

    Abstract: Image-based Reinforcement Learning is known to suffer from poor sample efficiency and generalisation to unseen visuals such as distractors (task-independent aspects of the observation space). Visual domain randomisation encourages transfer by training over visual factors of variation that may be encountered in the target domain. This increases learning complexity, can negatively impact learning ra… ▽ More

    Submitted 11 January, 2021; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Published at Conference on Robot Learning (CoRL) 2020

  25. arXiv:1910.14481  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Continual Unsupervised Representation Learning

    Authors: Dushyant Rao, Francesco Visin, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu, Raia Hadsell

    Abstract: Continual learning aims to improve the ability of modern learning systems to deal with non-stationary distributions, typically by attempting to learn a series of tasks sequentially. Prior art in the field has largely considered supervised or reinforcement learning tasks, and often assumes full knowledge of task labels and boundaries. In this work, we propose an approach (CURL) to tackle a more gen… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  26. arXiv:1910.10593  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    Neural Execution of Graph Algorithms

    Authors: Petar Veličković, Rex Ying, Matilde Padovano, Raia Hadsell, Charles Blundell

    Abstract: Graph Neural Networks (GNNs) are a powerful representational tool for solving problems on graph-structured inputs. In almost all cases so far, however, they have been applied to directly recovering a final solution from raw inputs, without explicit guidance on how to structure their problem-solving. Here, instead, we focus on learning in the space of algorithms: we train several state-of-the-art G… ▽ More

    Submitted 15 January, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: To appear at ICLR 2020. 13 pages, 4 figures

  27. arXiv:1910.06764  [pdf, other

    cs.LG cs.AI stat.ML

    Stabilizing Transformers for Reinforcement Learning

    Authors: Emilio Parisotto, H. Francis Song, Jack W. Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew M. Botvinick, Nicolas Heess, Raia Hadsell

    Abstract: Owing to their ability to both effectively integrate information over long time horizons and scale to massive amounts of data, self-attention architectures have recently shown breakthrough success in natural language processing (NLP), achieving state-of-the-art results in domains such as language modeling and machine translation. Harnessing the transformer's ability to process long time horizons o… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  28. arXiv:1909.00025  [pdf, other

    cs.LG cs.NE stat.ML

    Meta-Learning with Warped Gradient Descent

    Authors: Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell

    Abstract: Learning an efficient update rule from data that promotes rapid learning of new tasks from the same distribution remains an open problem in meta-learning. Typically, previous works have approached this issue either by attempting to train a neural network that directly produces updates or by attempting to learn better initialisations or scaling factors for a gradient-based update rule. Both of thes… ▽ More

    Submitted 18 February, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: 28 pages, 13 figures, 3 tables. Published as a conference paper at ICLR 2020

  29. arXiv:1903.08542  [pdf, other

    cs.RO

    Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning

    Authors: Sandy H. Huang, Martina Zambelli, Jackie Kay, Murilo F. Martins, Yuval Tassa, Patrick M. Pilarski, Raia Hadsell

    Abstract: Robots must know how to be gentle when they need to interact with fragile objects, or when the robot itself is prone to wear and tear. We propose an approach that enables deep reinforcement learning to train policies that are gentle, both during exploration and task execution. In a reward-based learning environment, a natural approach involves augmenting the (task) reward with a penalty for non-ge… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

  30. arXiv:1903.01292  [pdf, other

    cs.AI cs.CV cs.RO

    The StreetLearn Environment and Dataset

    Authors: Piotr Mirowski, Andras Banki-Horvath, Keith Anderson, Denis Teplyashin, Karl Moritz Hermann, Mateusz Malinowski, Matthew Koichi Grimes, Karen Simonyan, Koray Kavukcuoglu, Andrew Zisserman, Raia Hadsell

    Abstract: Navigation is a rich and well-grounded problem domain that drives progress in many different areas of research: perception, planning, memory, exploration, and optimisation in particular. Historically these challenges have been separately considered and solutions built that rely on stationary datasets - for example, recorded trajectories through an environment. These datasets cannot be used for dec… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: 13 pages, 6 figures, 4 tables. arXiv admin note: text overlap with arXiv:1804.00168

  31. arXiv:1903.00401  [pdf, other

    cs.AI cs.CL cs.CV

    Learning To Follow Directions in Street View

    Authors: Karl Moritz Hermann, Mateusz Malinowski, Piotr Mirowski, Andras Banki-Horvath, Keith Anderson, Raia Hadsell

    Abstract: Navigating and understanding the real world remains a key challenge in machine learning and inspires a great variety of research in areas such as language grounding, planning, navigation and computer vision. We propose an instruction-following task that requires all of the above, and which combines the practicality of simulated environments with the challenges of ambiguous, noisy real world data.… ▽ More

    Submitted 21 November, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

    Journal ref: AAAI 2020

  32. arXiv:1902.04623  [pdf, other

    cs.RO

    Value constrained model-free continuous control

    Authors: Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell

    Abstract: The naive application of Reinforcement Learning algorithms to continuous control problems -- such as locomotion and manipulation -- often results in policies which rely on high-amplitude, high-frequency control signals, known colloquially as bang-bang control. Although such solutions may indeed maximize task reward, they can be unsuitable for real world systems. Bang-bang control may lead to incre… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  33. arXiv:1812.07252  [pdf, other

    cs.RO cs.CV cs.LG

    Sim-to-Real via Sim-to-Sim: Data-efficient Robotic Gras** via Randomized-to-Canonical Adaptation Networks

    Authors: Stephen James, Paul Wohlhart, Mrinal Kalakrishnan, Dmitry Kalashnikov, Alex Irpan, Julian Ibarz, Sergey Levine, Raia Hadsell, Konstantinos Bousmalis

    Abstract: Real world data, especially in the domain of robotics, is notoriously costly to collect. One way to circumvent this can be to leverage the power of simulation to produce large amounts of labelled data. However, training models on simulated images does not readily transfer to real-world ones. Using domain adaptation methods to cross this "reality gap" requires a large amount of unlabelled real-worl… ▽ More

    Submitted 21 July, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019)

  34. From pixels to percepts: Highly robust edge perception and contour following using deep learning and an optical biomimetic tactile sensor

    Authors: Nathan F. Lepora, Alex Church, Conrad De Kerckhove, Raia Hadsell, John Lloyd

    Abstract: Deep learning has the potential to have the impact on robot touch that it has had on robot vision. Optical tactile sensors act as a bridge between the subjects by allowing techniques from vision to be applied to touch. In this paper, we apply deep learning to an optical biomimetic tactile sensor, the TacTip, which images an array of papillae (pins) inside its sensing surface analogous to structure… ▽ More

    Submitted 6 February, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: Accepted in RAL and ICRA 2019. N. Lepora and J. Lloyd contributed equally to this work

    Journal ref: IEEE Robotics and Automation Letters 4(2), 2101-2107, 2019

  35. arXiv:1807.05960  [pdf, other

    cs.LG cs.CV stat.ML

    Meta-Learning with Latent Embedding Optimization

    Authors: Andrei A. Rusu, Dushyant Rao, Jakub Sygnowski, Oriol Vinyals, Razvan Pascanu, Simon Osindero, Raia Hadsell

    Abstract: Gradient-based meta-learning techniques are both widely applicable and proficient at solving challenging few-shot learning and fast adaptation problems. However, they have practical difficulties when operating on high-dimensional parameter spaces in extreme low-data regimes. We show that it is possible to bypass these limitations by learning a data-dependent latent generative representation of mod… ▽ More

    Submitted 26 March, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

  36. arXiv:1807.05211  [pdf, other

    cs.RO

    Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

    Authors: Jake Bruce, Niko Sünderhauf, Piotr Mirowski, Raia Hadsell, Michael Milford

    Abstract: Model-free reinforcement learning has recently been shown to be effective at learning navigation policies from complex image input. However, these algorithms tend to require large amounts of interaction with the environment, which can be prohibitively costly to obtain on robots in the real world. We present an approach for efficiently learning goal-directed navigation policies on a mobile robot, f… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  37. arXiv:1806.01242  [pdf, other

    cs.LG cs.AI stat.ML

    Graph networks as learnable physics engines for inference and control

    Authors: Alvaro Sanchez-Gonzalez, Nicolas Heess, Jost Tobias Springenberg, Josh Merel, Martin Riedmiller, Raia Hadsell, Peter Battaglia

    Abstract: Understanding and interacting with everyday physical scenes requires rich knowledge about the structure of the world, represented either implicitly in a value or policy function, or explicitly in a transition model. Here we introduce a new class of learnable models--based on graph networks--which implement an inductive bias for object- and relation-centric representations of complex, dynamical sys… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  38. arXiv:1805.06370  [pdf, other

    stat.ML cs.LG

    Progress & Compress: A scalable framework for continual learning

    Authors: Jonathan Schwarz, Jelena Luketina, Wojciech M. Czarnecki, Agnieszka Grabska-Barwinska, Yee Whye Teh, Razvan Pascanu, Raia Hadsell

    Abstract: We introduce a conceptually simple and scalable framework for continual learning domains where tasks are learned sequentially. Our method is constant in the number of parameters and is designed to preserve performance on previously encountered tasks while accelerating learning progress on subsequent problems. This is achieved by training a network with two components: A knowledge base, capable of… ▽ More

    Submitted 2 July, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: Accepted at ICML 2018

  39. arXiv:1804.06557  [pdf, other

    cs.RO

    The Limits and Potentials of Deep Learning for Robotics

    Authors: Niko Sünderhauf, Oliver Brock, Walter Scheirer, Raia Hadsell, Dieter Fox, Jürgen Leitner, Ben Upcroft, Pieter Abbeel, Wolfram Burgard, Michael Milford, Peter Corke

    Abstract: The application of deep learning in robotics leads to very specific problems and research questions that are typically not addressed by the computer vision and machine learning communities. In this paper we discuss a number of robotics-specific learning, reasoning, and embodiment challenges for deep learning. We explain the need for better evaluation metrics, highlight the importance and unique ch… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

  40. arXiv:1804.00168  [pdf, other

    cs.AI

    Learning to Navigate in Cities Without a Map

    Authors: Piotr Mirowski, Matthew Koichi Grimes, Mateusz Malinowski, Karl Moritz Hermann, Keith Anderson, Denis Teplyashin, Karen Simonyan, Koray Kavukcuoglu, Andrew Zisserman, Raia Hadsell

    Abstract: Navigating through unstructured environments is a basic capability of intelligent creatures, and thus is of fundamental interest in the study and development of artificial intelligence. Long-range navigation is a complex cognitive task that relies on develo** an internal representation of space, grounded by recognisable landmarks and robust visual processing, that can simultaneously support cont… ▽ More

    Submitted 9 January, 2019; v1 submitted 31 March, 2018; originally announced April 2018.

    Comments: 17 pages, 16 figures, published at NeurIPS 2018

    Journal ref: Neural Information Processing Systems 2018

  41. arXiv:1802.09564  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement and Imitation Learning for Diverse Visuomotor Skills

    Authors: Yuke Zhu, Ziyu Wang, Josh Merel, Andrei Rusu, Tom Erez, Serkan Cabi, Saran Tunyasuvunakool, János Kramár, Raia Hadsell, Nando de Freitas, Nicolas Heess

    Abstract: We propose a model-free deep reinforcement learning method that leverages a small amount of demonstration data to assist a reinforcement learning agent. We apply this approach to robotic manipulation tasks and train end-to-end visuomotor policies that map directly from RGB camera inputs to joint velocities. We demonstrate that our approach can solve a wide variety of visuomotor tasks, for which en… ▽ More

    Submitted 27 May, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: 13 pages, 6 figures, Published in RSS 2018

  42. arXiv:1711.10137  [pdf, other

    cs.AI cs.LG cs.RO

    One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

    Authors: Jake Bruce, Niko Suenderhauf, Piotr Mirowski, Raia Hadsell, Michael Milford

    Abstract: Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fi… ▽ More

    Submitted 28 November, 2017; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: NIPS Workshop on Acting and Interacting in the Real World: Challenges in Robot Learning

    Journal ref: Bruce, Jake, et al. "One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay." Proceedings of the NIPS Workshop on Acting and Interacting in the Real World: Challenges in Robot Learning. 2017

  43. arXiv:1707.04175  [pdf, other

    cs.LG stat.ML

    Distral: Robust Multitask Reinforcement Learning

    Authors: Yee Whye Teh, Victor Bapst, Wojciech Marian Czarnecki, John Quan, James Kirkpatrick, Raia Hadsell, Nicolas Heess, Razvan Pascanu

    Abstract: Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network parameters, where efficiency may be improved through transfer across related tasks. In practice, however, this is not usually observed, because gradients from d… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  44. arXiv:1612.00796  [pdf, other

    cs.LG cs.AI stat.ML

    Overcoming catastrophic forgetting in neural networks

    Authors: James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell

    Abstract: The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have… ▽ More

    Submitted 25 January, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

  45. arXiv:1611.03673  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    Learning to Navigate in Complex Environments

    Authors: Piotr Mirowski, Razvan Pascanu, Fabio Viola, Hubert Soyer, Andrew J. Ballard, Andrea Banino, Misha Denil, Ross Goroshin, Laurent Sifre, Koray Kavukcuoglu, Dharshan Kumaran, Raia Hadsell

    Abstract: Learning to navigate in complex environments with dynamic elements is an important milestone in develo** AI agents. In this work we formulate the navigation question as a reinforcement learning problem and show that data efficiency and task performance can be dramatically improved by relying on additional auxiliary tasks leveraging multimodal sensory inputs. In particular we consider jointly lea… ▽ More

    Submitted 13 January, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: 11 pages, 5 appendix pages, 11 figures, 3 tables, under review as a conference paper at ICLR 2017

  46. arXiv:1610.04286  [pdf, other

    cs.RO cs.LG

    Sim-to-Real Robot Learning from Pixels with Progressive Nets

    Authors: Andrei A. Rusu, Mel Vecerik, Thomas Rothörl, Nicolas Heess, Razvan Pascanu, Raia Hadsell

    Abstract: Applying end-to-end learning to solve complex, interactive, pixel-driven control tasks on a robot is an unsolved problem. Deep Reinforcement Learning algorithms are too slow to achieve performance on a real robot, but their potential has been demonstrated in simulated environments. We propose using progressive networks to bridge the reality gap and transfer learned policies from simulation to the… ▽ More

    Submitted 22 May, 2018; v1 submitted 13 October, 2016; originally announced October 2016.

  47. arXiv:1606.04671  [pdf, other

    cs.LG

    Progressive Neural Networks

    Authors: Andrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins, Hubert Soyer, James Kirkpatrick, Koray Kavukcuoglu, Razvan Pascanu, Raia Hadsell

    Abstract: Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architec… ▽ More

    Submitted 22 October, 2022; v1 submitted 15 June, 2016; originally announced June 2016.

  48. arXiv:1511.06295  [pdf, other

    cs.LG

    Policy Distillation

    Authors: Andrei A. Rusu, Sergio Gomez Colmenarejo, Caglar Gulcehre, Guillaume Desjardins, James Kirkpatrick, Razvan Pascanu, Volodymyr Mnih, Koray Kavukcuoglu, Raia Hadsell

    Abstract: Policies for complex visual tasks have been successfully learned with deep reinforcement learning, using an approach called deep Q-networks (DQN), but relatively large (task-specific) networks and extensive training are needed to achieve good performance. In this work, we present a novel method called policy distillation that can be used to extract the policy of a reinforcement learning agent and… ▽ More

    Submitted 7 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Submitted to ICLR 2016