Skip to main content

Showing 1–8 of 8 results for author: Lee, A X

.
  1. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  2. arXiv:2205.03353  [pdf, other

    cs.RO cs.LG

    How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation

    Authors: Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis

    Abstract: Reinforcement learning (RL) has been shown to be effective at learning control from experience. However, RL typically requires a large amount of online interaction with the environment. This limits its applicability to real-world settings, such as in robotics, where such interaction is expensive. In this work we investigate ways to minimize online interactions in a target task, by reusing a subopt… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  3. arXiv:2110.06192  [pdf, other

    cs.RO cs.LG

    Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

    Authors: Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

    Abstract: We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can ef… ▽ More

    Submitted 3 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: CoRL 2021. Video: https://dpmd.ai/robotics-stacking-YT . Blog: https://dpmd.ai/robotics-stacking . Code: https://github.com/deepmind/rgb_stacking

  4. arXiv:1907.00953  [pdf, other

    cs.LG cs.AI stat.ML

    Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

    Authors: Alex X. Lee, Anusha Nagabandi, Pieter Abbeel, Sergey Levine

    Abstract: Deep reinforcement learning (RL) algorithms can use high-capacity deep networks to learn directly from image observations. However, these high-dimensional observation spaces present a number of challenges in practice, since the policy must now solve two problems: representation learning and task learning. In this work, we tackle these two problems separately, by explicitly learning latent represen… ▽ More

    Submitted 26 October, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Project website: https://alexlee-gk.github.io/slac/

  5. arXiv:1810.03043  [pdf, other

    cs.RO cs.AI cs.CV

    Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning

    Authors: Frederik Ebert, Sudeep Dasari, Alex X. Lee, Sergey Levine, Chelsea Finn

    Abstract: Prediction is an appealing objective for self-supervised learning of behavioral skills, particularly for autonomous robots. However, effectively utilizing predictive models for control, especially with raw image inputs, poses a number of major challenges. How should the predictions be used? What happens when they are inaccurate? In this paper, we tackle these questions by proposing a method for le… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: accepted at the Conference on Robot Learning (CoRL) 2018

  6. arXiv:1804.01523  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Stochastic Adversarial Video Prediction

    Authors: Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine

    Abstract: Being able to predict what may happen in the future requires an in-depth understanding of the physical and causal rules that govern the world. A model that is able to do so has a number of appealing applications, from robotic planning to representation learning. However, learning to predict raw future observations, such as frames in a video, is exceedingly challenging -- the ambiguous nature of th… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: Website: https://alexlee-gk.github.io/video_prediction/

  7. arXiv:1710.05268  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Self-Supervised Visual Planning with Temporal Skip Connections

    Authors: Frederik Ebert, Chelsea Finn, Alex X. Lee, Sergey Levine

    Abstract: In order to autonomously learn wide repertoires of complex skills, robots must be able to learn from their own autonomously collected data, without human supervision. One learning signal that is always available for autonomously collected data is prediction: if a robot can learn to predict the future, it can use this predictive model to take actions to produce desired outcomes, such as moving an o… ▽ More

    Submitted 14 October, 2017; originally announced October 2017.

    Comments: accepted at the Conference on Robot Learning (CoRL) 2017

  8. arXiv:1703.11000  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Visual Servoing with Deep Features and Fitted Q-Iteration

    Authors: Alex X. Lee, Sergey Levine, Pieter Abbeel

    Abstract: Visual servoing involves choosing actions that move a robot in response to observations from a camera, in order to reach a goal configuration in the world. Standard visual servoing approaches typically rely on manually designed features and analytical dynamics models, which limits their generalization capability and often requires extensive application-specific feature and model engineering. In th… ▽ More

    Submitted 10 July, 2017; v1 submitted 31 March, 2017; originally announced March 2017.

    Comments: ICLR 2017