Skip to main content

Showing 1–13 of 13 results for author: Groth, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05546  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Actor-Critic Reinforcement Learning Scales to Large Models

    Authors: Jost Tobias Springenberg, Abbas Abdolmaleki, **gwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin Riedmiller

    Abstract: We show that offline actor-critic reinforcement learning can scale to large models - such as transformers - and follows similar scaling laws as supervised learning. We find that offline actor-critic algorithms can outperform strong, supervised, behavioral cloning baselines for multi-task training on a large dataset containing both sub-optimal and expert behavior on 132 continuous control tasks. We… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  2. arXiv:2312.11374  [pdf, other

    cs.RO

    Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

    Authors: Thomas Lampe, Abbas Abdolmaleki, Sarah Bechtle, Sandy H. Huang, Jost Tobias Springenberg, Michael Bloesch, Oliver Groth, Roland Hafner, Tim Hertweck, Michael Neunert, Markus Wulfmeier, **gwei Zhang, Francesco Nori, Nicolas Heess, Martin Riedmiller

    Abstract: Reinforcement learning solely from an agent's self-generated data is often believed to be infeasible for learning on real robots, due to the amount of data needed. However, if done right, agents learning from real data can be surprisingly efficient through re-using previously collected sub-optimal data. In this paper we demonstrate how the increased understanding of off-policy learning methods and… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  4. arXiv:2305.01521  [pdf, other

    cs.LG stat.ML

    Unlocking the Power of Representations in Long-term Novelty-based Exploration

    Authors: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, Bilal Piot

    Abstract: We introduce Robust Exploration via Clustering-based Online Density Estimation (RECODE), a non-parametric method for novelty-based exploration that estimates visitation counts for clusters of states based on their similarity in a chosen embedding space. By adapting classical clustering to the nonstationary setting of Deep RL, RECODE can efficiently track state visitation counts over thousands of e… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  5. arXiv:2109.08603  [pdf, other

    cs.LG cs.NE cs.RO

    Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration

    Authors: Oliver Groth, Markus Wulfmeier, Giulia Vezzani, Vibhavari Dasagi, Tim Hertweck, Roland Hafner, Nicolas Heess, Martin Riedmiller

    Abstract: Curiosity-based reward schemes can present powerful exploration mechanisms which facilitate the discovery of solutions for complex, sparse or long-horizon tasks. However, as the agent learns to reach previously unexplored spaces and the objective adapts to reward new areas, many behaviours emerge only to disappear due to being overwritten by the constantly shifting objective. We argue that merely… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 14 pages, 7 figures, 2 tables

    ACM Class: I.2.6; I.2.9

  6. arXiv:2007.01272  [pdf, other

    cs.CV

    RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

    Authors: Sebastien Ehrhardt, Oliver Groth, Aron Monszpart, Martin Engelcke, Ingmar Posner, Niloy Mitra, Andrea Vedaldi

    Abstract: We present RELATE, a model that learns to generate physically plausible scenes and videos of multiple interacting objects. Similar to other generative approaches, RELATE is trained end-to-end on raw, unlabeled data. RELATE combines an object-centric GAN formulation with a model that explicitly accounts for correlations between individual objects. This allows the model to generate realistic scenes… ▽ More

    Submitted 9 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  7. arXiv:2003.08854  [pdf, other

    cs.RO cs.CV cs.LG

    Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill Primitives

    Authors: Oliver Groth, Chia-Man Hung, Andrea Vedaldi, Ingmar Posner

    Abstract: Visuomotor control (VMC) is an effective means of achieving basic manipulation tasks such as pushing or pick-and-place from raw images. Conditioning VMC on desired goal states is a promising way of achieving versatile skill primitives. However, common conditioning schemes either rely on task-specific fine tuning - e.g. using one-shot imitation learning (IL) - or on sampling approaches using a forw… ▽ More

    Submitted 24 September, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: revised manuscript with additional baselines and generalisation experiments; 11 pages, 8 figures, 7 tables

    ACM Class: I.2.9; I.2.10

  8. arXiv:1909.13561  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Imagine That! Leveraging Emergent Affordances for 3D Tool Synthesis

    Authors: Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: In this paper we explore the richness of information captured by the latent space of a vision-based generative model. The model combines unsupervised generative learning with a task-based performance predictor to learn and to exploit task-relevant object affordances given visual observations from a reaching task, involving a scenario and a stick-like tool. While the learned embedding of the genera… ▽ More

    Submitted 7 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 12 pages, 6 figures

    ACM Class: I.2.10; I.2.6

  9. arXiv:1806.05502  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Scrutinizing and De-Biasing Intuitive Physics with Neural Stethoscopes

    Authors: Fabian B. Fuchs, Oliver Groth, Adam R. Kosiorek, Alex Bewley, Markus Wulfmeier, Andrea Vedaldi, Ingmar Posner

    Abstract: Visually predicting the stability of block towers is a popular task in the domain of intuitive physics. While previous work focusses on prediction accuracy, a one-dimensional performance measure, we provide a broader analysis of the learned physical understanding of the final model and how the learning process can be guided. To this end, we introduce neural stethoscopes as a general purpose framew… ▽ More

    Submitted 6 September, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

  10. arXiv:1804.08018  [pdf, other

    cs.CV

    ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking

    Authors: Oliver Groth, Fabian B. Fuchs, Ingmar Posner, Andrea Vedaldi

    Abstract: Physical intuition is pivotal for intelligent agents to perform complex tasks. In this paper we investigate the passive acquisition of an intuitive understanding of physical principles as well as the active utilisation of this intuition in the context of generalised object stacking. To this end, we provide: a simulation-based dataset featuring 20,000 stack configurations composed of a variety of e… ▽ More

    Submitted 6 July, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: revised version to appear at ECCV 2018

  11. Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

    Authors: Omid Hosseini Jafari, Oliver Groth, Alexander Kirillov, Michael Ying Yang, Carsten Rother

    Abstract: This paper addresses the task of designing a modular neural network architecture that jointly solves different tasks. As an example we use the tasks of depth estimation and semantic segmentation given a single RGB image. The main focus of this work is to analyze the cross-modality influence between depth and semantic prediction maps on their joint refinement. While most previous works solely focus… ▽ More

    Submitted 26 February, 2017; originally announced February 2017.

    Comments: Accepted to ICRA 2017

  12. arXiv:1602.07332  [pdf, other

    cs.CV cs.AI

    Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

    Authors: Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A. Shamma, Michael S. Bernstein, Fei-Fei Li

    Abstract: Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle the rich content in images for cognitive tasks are still being trained using the same datasets designe… ▽ More

    Submitted 23 February, 2016; originally announced February 2016.

    Comments: 44 pages, 37 figures

  13. arXiv:1511.03416  [pdf, other

    cs.CV cs.LG cs.NE

    Visual7W: Grounded Question Answering in Images

    Authors: Yuke Zhu, Oliver Groth, Michael Bernstein, Li Fei-Fei

    Abstract: We have seen great progress in basic perceptual tasks such as object recognition and detection. However, AI models still fail to match humans in high-level vision tasks due to the lack of capacities for deeper reasoning. Recently the new task of visual question answering (QA) has been proposed to evaluate a model's capacity for deep image understanding. Previous works have established a loose, glo… ▽ More

    Submitted 9 April, 2016; v1 submitted 11 November, 2015; originally announced November 2015.

    Comments: CVPR 2016