Skip to main content

Showing 1–16 of 16 results for author: Večerík, M

.
  1. arXiv:2404.13478  [pdf, other

    cs.RO cs.CV cs.LG

    Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

    Authors: Ben Eisner, Yi Yang, Todor Davchev, Mel Vecerik, Jonathan Scholz, David Held

    Abstract: Many robot manipulation tasks can be framed as geometric reasoning tasks, where an agent must be able to precisely manipulate an object into a position that satisfies the task from a set of initial conditions. Often, task success is defined based on the relationship between two objects - for instance, hanging a mug on a rack. In such cases, the solution should be equivariant to the initial positio… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Published at International Conference on Representation Learning (ICLR 2024)

  2. arXiv:2308.15975  [pdf, other

    cs.RO cs.AI cs.CV

    RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

    Authors: Mel Vecerik, Carl Doersch, Yi Yang, Todor Davchev, Yusuf Aytar, Guangyao Zhou, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: For robots to be useful outside labs and specialized factories we need a way to teach them new useful behaviors quickly. Current approaches lack either the generality to onboard new tasks without task-specific engineering, or else lack the data-efficiency to do so in an amount of time that enables practical use. In this work we explore dense tracking as a representational vehicle to allow faster a… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Project website: https://robotap.github.io

  3. arXiv:2306.08637  [pdf, other

    cs.CV

    TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement

    Authors: Carl Doersch, Yi Yang, Mel Vecerik, Dilara Gokay, Ankush Gupta, Yusuf Aytar, Joao Carreira, Andrew Zisserman

    Abstract: We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried point on any physical surface throughout a video sequence. Our approach employs two stages: (1) a matching stage, which independently locates a suitable candidate point match for the query point on every other frame, and (2) a refinement stage, which updates both the trajectory and query features based on loc… ▽ More

    Submitted 30 August, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Published at ICCV 2023

  4. arXiv:2112.04910  [pdf, other

    cs.RO cs.CV

    Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings

    Authors: Mel Vecerik, Jackie Kay, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: Dense object tracking, the ability to localize specific object points with pixel-level accuracy, is an important computer vision task with numerous downstream applications in robotics. Existing approaches either compute dense keypoint embeddings in a single forward pass, meaning the model is trained to track everything at once, or allocate their full capacity to a sparse predefined set of points,… ▽ More

    Submitted 13 December, 2021; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Supplementary material available at: https://sites.google.com/view/2021-tack

  5. arXiv:2103.11512  [pdf, other

    cs.AI cs.RO

    Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study

    Authors: Jianlan Luo, Oleg Sushkov, Rugile Pevceviciute, Wenzhao Lian, Chang Su, Mel Vecerik, Ning Ye, Stefan Schaal, Jon Scholz

    Abstract: Over the past several years there has been a considerable research investment into learning-based approaches to industrial assembly, but despite significant progress these techniques have yet to be adopted by industry. We argue that it is the prohibitively large design space for Deep Reinforcement Learning (DRL), rather than algorithmic limitations per se, that are truly responsible for this lack… ▽ More

    Submitted 31 July, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: RSS 2021

  6. arXiv:2009.14711  [pdf, other

    cs.RO cs.CV cs.LG

    S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

    Authors: Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov, David Barker, Rugile Pevceviciute, Thomas Rothörl, Christopher Schuster, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

    Abstract: A robot's ability to act is fundamentally constrained by what it can perceive. Many existing approaches to visual representation learning utilize general-purpose training criteria, e.g. image reconstruction, smoothness in latent space, or usefulness for control, or else make use of large datasets annotated with specific features (bounding boxes, segmentations, etc.). However, both approaches often… ▽ More

    Submitted 13 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 11 pages, supplementary material available at: https://sites.google.com/view/2020-s3k/home

  7. arXiv:1911.06833  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient

    Authors: Kevin Sebastian Luck, Mel Vecerik, Simon Stepputtis, Heni Ben Amor, Jonathan Scholz

    Abstract: Model-free reinforcement learning algorithms such as Deep Deterministic Policy Gradient (DDPG) often require additional exploration strategies, especially if the actor is of deterministic nature. This work evaluates the use of model-based trajectory optimization methods used for exploration in Deep Deterministic Policy Gradient when trained on a latent image embedding. In addition, an extension of… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Accepted for IROS 2019

  8. arXiv:1909.12200  [pdf, other

    cs.RO cs.LG

    Scaling data-driven robotics with reward sketching and batch reinforcement learning

    Authors: Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

    Abstract: We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions. We show how to apply this framework to accomplish three different object manipulation tasks on a real robot platform. Given demonstrations of a task together with task-agnostic recorded experience, we use a special form of human… ▽ More

    Submitted 4 June, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Project website: https://sites.google.com/view/data-driven-robotics/

    Journal ref: Robotics: Science and Systems Conference 2020

  9. arXiv:1904.01139  [pdf, other

    cs.LG stat.ML

    Generative predecessor models for sample-efficient imitation learning

    Authors: Yannick Schroecker, Mel Vecerik, Jonathan Scholz

    Abstract: We propose Generative Predecessor Models for Imitation Learning (GPRIL), a novel imitation learning algorithm that matches the state-action distribution to the distribution observed in expert demonstrations, using generative models to reason probabilistically about alternative histories of demonstrated states. We show that this approach allows an agent to learn robust policies using only a small n… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  10. arXiv:1810.01531  [pdf, other

    cs.RO

    A Practical Approach to Insertion with Variable Socket Position Using Deep Reinforcement Learning

    Authors: Mel Vecerik, Oleg Sushkov, David Barker, Thomas Rothörl, Todd Hester, Jon Scholz

    Abstract: Insertion is a challenging haptic and visual control problem with significant practical value for manufacturing. Existing approaches in the model-based robotics community can be highly effective when task geometry is known, but are complex and cumbersome to implement, and must be tailored to each individual problem by a qualified engineer. Within the learning community there is a long history of i… ▽ More

    Submitted 8 October, 2018; v1 submitted 2 October, 2018; originally announced October 2018.

  11. arXiv:1805.11593  [pdf, other

    cs.LG cs.AI stat.ML

    Observe and Look Further: Achieving Consistent Performance on Atari

    Authors: Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin

    Abstract: Despite significant advances in the field of deep Reinforcement Learning (RL), today's algorithms still fail to learn human-level policies consistently over a set of diverse tasks such as Atari 2600 games. We identify three key challenges that any algorithm needs to master in order to perform well on all games: processing diverse reward distributions, reasoning over long time horizons, and explori… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

  12. arXiv:1801.08757  [pdf, other

    cs.AI

    Safe Exploration in Continuous Action Spaces

    Authors: Gal Dalal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, Yuval Tassa

    Abstract: We address the problem of deploying a reinforcement learning (RL) agent on a physical system such as a datacenter cooling unit or robot, where critical constraints must never be violated. We show how to exploit the typically smooth dynamics of these systems and enable RL algorithms to never violate constraints during learning. Our technique is to directly add to the policy a safety layer that anal… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

  13. arXiv:1707.08817  [pdf, other

    cs.AI

    Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

    Authors: Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

    Abstract: We propose a general and model-free approach for Reinforcement Learning (RL) on real robotics with sparse rewards. We build upon the Deep Deterministic Policy Gradient (DDPG) algorithm to use demonstrations. Both demonstrations and actual interactions are used to fill a replay buffer and the sampling ratio between demonstrations and transitions is automatically tuned via a prioritized replay mecha… ▽ More

    Submitted 8 October, 2018; v1 submitted 27 July, 2017; originally announced July 2017.

  14. arXiv:1704.03732  [pdf, ps, other

    cs.AI cs.LG

    Deep Q-learning from Demonstrations

    Authors: Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

    Abstract: Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance during learning can be extremely poor. This may be acceptable for a simulator, but it severely limits the applicability of deep RL to many real-world… ▽ More

    Submitted 22 November, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Published at AAAI 2018. Previously on arxiv as "Learning from Demonstrations for Real World Reinforcement Learning"

  15. arXiv:1704.03073  [pdf, other

    cs.LG cs.RO

    Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

    Authors: Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin Riedmiller

    Abstract: Deep learning and reinforcement learning methods have recently been used to solve a variety of problems in continuous control domains. An obvious application of these techniques is dexterous manipulation tasks in robotics which are difficult to solve using traditional control theory or hand-engineered approaches. One example of such a task is to grasp an object and precisely stack it on another. S… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: 12 pages, 5 Figures

  16. arXiv:1610.04286  [pdf, other

    cs.RO cs.LG

    Sim-to-Real Robot Learning from Pixels with Progressive Nets

    Authors: Andrei A. Rusu, Mel Vecerik, Thomas Rothörl, Nicolas Heess, Razvan Pascanu, Raia Hadsell

    Abstract: Applying end-to-end learning to solve complex, interactive, pixel-driven control tasks on a robot is an unsolved problem. Deep Reinforcement Learning algorithms are too slow to achieve performance on a real robot, but their potential has been demonstrated in simulated environments. We propose using progressive networks to bridge the reality gap and transfer learned policies from simulation to the… ▽ More

    Submitted 22 May, 2018; v1 submitted 13 October, 2016; originally announced October 2016.