Skip to main content

Showing 1–11 of 11 results for author: Manuelli, L

.
  1. arXiv:2307.04751  [pdf, other

    cs.RO cs.CV cs.LG

    Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement

    Authors: Anthony Simeonov, Ankit Goyal, Lucas Manuelli, Lin Yen-Chen, Alina Sarmiento, Alberto Rodriguez, Pulkit Agrawal, Dieter Fox

    Abstract: We propose a system for rearranging objects in a scene to achieve a desired object-scene placing relationship, such as a book inserted in an open slot of a bookshelf. The pipeline generalizes to novel geometries, poses, and layouts of both scenes and objects, and is trained from demonstrations to operate directly on 3D point clouds. Our system overcomes challenges associated with the existence of… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Project page: https://anthonysimeonov.github.io/rpdiff-multi-modal/

  2. arXiv:2212.06870  [pdf, other

    cs.CV cs.RO

    MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

    Authors: Yann Labbé, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic

    Abstract: We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii) a CAD model of the observed object. The contributions of this work are threefold. First, we present a 6D pose refiner based on a render&compare strategy which c… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: CoRL 2022

  3. arXiv:2209.05451  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

    Authors: Mohit Shridhar, Lucas Manuelli, Dieter Fox

    Abstract: Transformers have revolutionized vision and natural language processing with their ability to scale with large datasets. But in robotic manipulation, data is both limited and expensive. Can manipulation still benefit from Transformers with the right problem formulation? We investigate this question with PerAct, a language-conditioned behavior-cloning agent for multi-task 6-DoF manipulation. PerAct… ▽ More

    Submitted 11 November, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: CoRL 2022. Project Website: https://peract.github.io/

  4. arXiv:2109.12098  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    CLIPort: What and Where Pathways for Robotic Manipulation

    Authors: Mohit Shridhar, Lucas Manuelli, Dieter Fox

    Abstract: How can we imbue robots with the ability to manipulate objects precisely but also to reason about them in terms of abstract concepts? Recent works in manipulation have shown that end-to-end networks can learn dexterous skills that require precise spatial reasoning, but these methods often fail to generalize to new goals or quickly learn transferable concepts across tasks. In parallel, there has be… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: CoRL 2021. Project Website: https://cliport.github.io/

  5. arXiv:2009.05085  [pdf, other

    cs.RO

    Keypoints into the Future: Self-Supervised Correspondence in Model-Based Reinforcement Learning

    Authors: Lucas Manuelli, Yunzhu Li, Pete Florence, Russ Tedrake

    Abstract: Predictive models have been at the core of many robotic systems, from quadrotors to walking robots. However, it has been challenging to develop and apply such models to practical robotic manipulation due to high-dimensional sensory observations such as images. Previous approaches to learning models in the context of robotic manipulation have either learned whole image dynamics or used autoencoders… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

  6. arXiv:1909.06933  [pdf, other

    cs.RO cs.CV cs.LG

    Self-Supervised Correspondence in Visuomotor Policy Learning

    Authors: Peter Florence, Lucas Manuelli, Russ Tedrake

    Abstract: In this paper we explore using self-supervised correspondence for improving the generalization performance and sample efficiency of visuomotor policy learning. Prior work has primarily used approaches such as autoencoding, pose-based losses, and end-to-end policy optimization in order to train the visual portion of visuomotor policies. We instead propose an approach using self-supervised dense vis… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: Video at: https://sites.google.com/view/visuomotor-correspondence

  7. arXiv:1903.06684  [pdf, other

    cs.RO

    kPAM: KeyPoint Affordances for Category-Level Robotic Manipulation

    Authors: Lucas Manuelli, Wei Gao, Peter Florence, Russ Tedrake

    Abstract: We would like robots to achieve purposeful manipulation by placing any instance from a category of objects into a desired set of goal states. Existing manipulation pipelines typically specify the desired configuration as a target 6-DOF pose and rely on explicitly estimating the pose of the manipulated objects. However, representing an object with a parameterized transformation defined on a fixed t… ▽ More

    Submitted 29 October, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: First two authors contributed equally. The video and supplemental material is available at https://sites.google.com/view/kpam

  8. arXiv:1806.08756  [pdf, other

    cs.RO cs.CV cs.LG

    Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation

    Authors: Peter R. Florence, Lucas Manuelli, Russ Tedrake

    Abstract: What is the right object representation for manipulation? We would like robots to visually perceive scenes and learn an understanding of the objects in them that (i) is task-agnostic and can be used as a building block for a variety of manipulation tasks, (ii) is generally applicable to both rigid and non-rigid objects, (iii) takes advantage of the strong priors provided by 3D vision, and (iv) is… ▽ More

    Submitted 7 September, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

  9. arXiv:1707.04796  [pdf, other

    cs.CV cs.RO

    LabelFusion: A Pipeline for Generating Ground Truth Labels for Real RGBD Data of Cluttered Scenes

    Authors: Pat Marion, Peter R. Florence, Lucas Manuelli, Russ Tedrake

    Abstract: Deep neural network (DNN) architectures have been shown to outperform traditional pipelines for object segmentation and pose estimation using RGBD data, but the performance of these DNN pipelines is directly tied to how representative the training data is of the true data. Hence a key requirement for employing these methods in practice is to have a large set of labeled data for your specific robot… ▽ More

    Submitted 26 September, 2017; v1 submitted 15 July, 2017; originally announced July 2017.

  10. arXiv:1207.3575  [pdf, ps, other

    math.DS

    On Li-Yorke Measurable Sensitivity

    Authors: Jared Hallett, Lucas Manuelli, Cesar E. Silva

    Abstract: The notion of Li-Yorke sensitivity has been studied extensively in the case of topological dynamical systems. We introduce a measurable version of Li-Yorke sensitivity, for nonsingular (and measure-preserving) dynamical systems, and compare it with various mixing notions. It is known that in the case of nonsingular dynamical systems, ergodic Cartesian square implies double ergodicity, which in tur… ▽ More

    Submitted 3 February, 2014; v1 submitted 16 July, 2012; originally announced July 2012.

    Comments: Corrected some statements in Section 6 (old Theorem 3); added references

    MSC Class: 37A40; 37A05

  11. Recurrent Partial Words

    Authors: Francine Blanchet-Sadri, Aleksandar Chakarov, Lucas Manuelli, Jarett Schwartz, Slater Stich

    Abstract: Partial words are sequences over a finite alphabet that may contain wildcard symbols, called holes, which match or are compatible with all letters; partial words without holes are said to be full words (or simply words). Given an infinite partial word w, the number of distinct full words over the alphabet that are compatible with factors of w of length n, called subwords of w, refers to a measure… ▽ More

    Submitted 17 August, 2011; originally announced August 2011.

    Comments: In Proceedings WORDS 2011, arXiv:1108.3412

    Journal ref: EPTCS 63, 2011, pp. 71-82