Skip to main content

Showing 1–50 of 56 results for author: Platt, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15677  [pdf, other

    cs.RO

    Open-vocabulary Pick and Place via Patch-level Semantic Maps

    Authors: Mingxi Jia, Haojie Huang, Zhewen Zhang, Chenghao Wang, Linfeng Zhao, Dian Wang, Jason Xinyu Liu, Robin Walters, Robert Platt, Stefanie Tellex

    Abstract: Controlling robots through natural language instructions in open-vocabulary scenarios is pivotal for enhancing human-robot collaboration and complex robot behavior synthesis. However, achieving this capability poses significant challenges due to the need for a system that can generalize from limited data to a wide range of tasks and environments. Existing methods rely on large, costly datasets and… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.13961  [pdf, other

    cs.LG cs.RO

    Equivariant Offline Reinforcement Learning

    Authors: Arsh Tangri, Ondrej Biza, Dian Wang, David Klee, Owen Howell, Robert Platt

    Abstract: Sample efficiency is critical when applying learning-based methods to robotic manipulation due to the high cost of collecting expert demonstrations and the challenges of on-robot policy learning through online Reinforcement Learning (RL). Offline RL addresses this issue by enabling policy learning from an offline dataset collected using any behavioral policy, regardless of its quality. However, re… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.11740  [pdf, other

    cs.RO cs.AI cs.LG

    Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

    Authors: Haojie Huang, Karl Schmeckpeper, Dian Wang, Ondrej Biza, Yaoyao Qian, Haotian Liu, Mingxi Jia, Robert Platt, Robin Walters

    Abstract: Humans can imagine goal states during planning and perform actions to match those goals. In this work, we propose Imagination Policy, a novel multi-task key-frame policy network for solving high-precision pick and place tasks. Instead of learning actions directly, Imagination Policy generates point clouds to imagine desired states which are then translated to actions using rigid action estimation.… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2403.17757  [pdf, other

    cs.CV cs.LG

    Noise2Noise Denoising of CRISM Hyperspectral Data

    Authors: Robert Platt, Rossella Arcucci, Cédric M. John

    Abstract: Hyperspectral data acquired by the Compact Reconnaissance Imaging Spectrometer for Mars (CRISM) have allowed for unparalleled map** of the surface mineralogy of Mars. Due to sensor degradation over time, a significant portion of the recently acquired data is considered unusable. Here a new data-driven model architecture, Noise2Noise4Mars (N2N4M), is introduced to remove noise from CRISM images.… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures. Accepted as a conference paper at the ICLR 2024 ML4RS Workshop

  5. arXiv:2401.12046  [pdf, other

    cs.RO cs.LG

    Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D

    Authors: Haojie Huang, Owen Howell, Dian Wang, Xupeng Zhu, Robin Walters, Robert Platt

    Abstract: Many complex robotic manipulation tasks can be decomposed as a sequence of pick and place actions. Training a robotic agent to learn this sequence over many different starting conditions typically requires many iterations or demonstrations, especially in 3D environments. In this work, we propose Fourier Transporter (FourTran) which leverages the two-fold SE(d)xSE(d) symmetry in the pick-place prob… ▽ More

    Submitted 15 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  6. arXiv:2308.14670  [pdf, other

    cs.RO

    Symmetric Models for Visual Force Policy Learning

    Authors: Colin Kohler, Anuj Shrivatsav Srikanth, Eshan Arora, Robert Platt

    Abstract: While it is generally acknowledged that force feedback is beneficial to robotic control, applications of policy learning to robotic manipulation typically only leverage visual feedback. Recently, symmetric neural models have been used to significantly improve the sample efficiency and performance of policy learning across a variety of robotic manipulation domains. This paper explores an applicatio… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  7. arXiv:2308.07948  [pdf, other

    cs.RO cs.AI cs.CV

    Leveraging Symmetries in Pick and Place

    Authors: Haojie Huang, Dian Wang, Arsh Tangri, Robin Walters, Robert Platt

    Abstract: Robotic pick and place tasks are symmetric under translations and rotations of both the object to be picked and the desired place pose. For example, if the pick object is rotated or translated, then the optimal pick action should also rotate or translate. The same is true for the place pose; if the desired place pose changes, then the place action should also transform accordingly. A recently prop… ▽ More

    Submitted 22 December, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: International Journal of Robotics Research. arXiv admin note: substantial text overlap with arXiv:2202.09400

  8. arXiv:2306.12392  [pdf, other

    cs.RO cs.LG

    One-shot Imitation Learning via Interaction War**

    Authors: Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt

    Abstract: Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction War**, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D mesh of each object in the environment using shape war**, a technique for aligning point clouds across object instances. Then, we represent manipulation actio… ▽ More

    Submitted 4 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: CoRL 2023

  9. arXiv:2306.06489  [pdf, other

    cs.RO cs.AI

    On Robot Grasp Learning Using Equivariant Models

    Authors: Xupeng Zhu, Dian Wang, Guanang Su, Ondrej Biza, Robin Walters, Robert Platt

    Abstract: Real-world grasp detection is challenging due to the stochasticity in grasp dynamics and the noise in hardware. Ideally, the system would adapt to the real world by training directly on physical systems. However, this is generally difficult due to the large amount of training data required by most grasp learning models. In this paper, we note that the planar grasp function is $\SE(2)$-equivariant… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted in Autonomous Robot. arXiv admin note: substantial text overlap with arXiv:2202.09468

  10. arXiv:2303.04745  [pdf, other

    cs.LG stat.ML

    A General Theory of Correct, Incorrect, and Extrinsic Equivariance

    Authors: Dian Wang, Xupeng Zhu, Jung Yeon Park, Mingxi Jia, Guanang Su, Robert Platt, Robin Walters

    Abstract: Although equivariant machine learning has proven effective at many tasks, success depends heavily on the assumption that the ground truth function is symmetric over the entire domain matching the symmetry in an equivariant neural network. A missing piece in the equivariant learning literature is the analysis of equivariant networks when symmetry exists only partially in the domain. In this work, w… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at NeurIPS 2023

  11. arXiv:2302.13926  [pdf, other

    cs.CV

    Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction

    Authors: David M. Klee, Ondrej Biza, Robert Platt, Robin Walters

    Abstract: Predicting the pose of objects from a single image is an important but difficult computer vision problem. Methods that predict a single point estimate do not predict the pose of objects with symmetries well and cannot represent uncertainty. Alternatively, some works predict a distribution over orientations in $\mathrm{SO}(3)$. However, training such models can be computation- and sample-inefficien… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  12. arXiv:2211.09231  [pdf, other

    cs.LG cs.RO

    The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry

    Authors: Dian Wang, Jung Yeon Park, Neel Sortur, Lawson L. S. Wong, Robin Walters, Robert Platt

    Abstract: Extensive work has demonstrated that equivariant neural networks can significantly improve sample efficiency and generalization by enforcing an inductive bias in the network architecture. These applications typically assume that the domain symmetry is fully described by explicit transformations of the model inputs and outputs. However, many real-life applications contain only latent or partial sym… ▽ More

    Submitted 10 February, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Published at ICLR 2023, notable top 25% (Spotlight)

  13. arXiv:2211.04895  [pdf, ps, other

    cs.RO

    Grasp Learning: Models, Methods, and Performance

    Authors: Robert Platt

    Abstract: Grasp learning has become an exciting and important topic in robotics. Just a few years ago, the problem of gras** novel objects from unstructured piles of clutter was considered a serious research challenge. Now, it is a capability that is quickly becoming incorporated into industrial supply chain automation. How did that happen? What is the current state of the art in robotic grasp learning, w… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  14. arXiv:2211.01991  [pdf, other

    cs.RO cs.LG

    Leveraging Fully Observable Policies for Learning under Partial Observability

    Authors: Hai Nguyen, Andrea Baisero, Dian Wang, Christopher Amato, Robert Platt

    Abstract: Reinforcement learning in partially observable domains is challenging due to the lack of observable state information. Thankfully, learning offline in a simulator with such state information is often possible. In particular, we propose a method for partially observable reinforcement learning that uses a fully observable policy (which we call a state expert) during offline training to improve onlin… ▽ More

    Submitted 10 November, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Accepted at the 2022 Conference on Robot Learning (CoRL), Auckland, New Zealand

  15. arXiv:2211.00194  [pdf, other

    cs.RO

    SEIL: Simulation-augmented Equivariant Imitation Learning

    Authors: Mingxi Jia, Dian Wang, Guanang Su, David Klee, Xupeng Zhu, Robin Walters, Robert Platt

    Abstract: In robotic manipulation, acquiring samples is extremely expensive because it often requires interacting with the real world. Traditional image-level data augmentation has shown the potential to improve sample efficiency in various machine learning tasks. However, image-level data augmentation is insufficient for an imitation learning agent to learn good manipulation policies in a reasonable amount… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  16. arXiv:2211.00191  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Edge Grasp Network: A Graph-Based SE(3)-invariant Approach to Grasp Detection

    Authors: Haojie Huang, Dian Wang, Xupeng Zhu, Robin Walters, Robert Platt

    Abstract: Given point cloud input, the problem of 6-DoF grasp pose detection is to identify a set of hand poses in SE(3) from which an object can be successfully grasped. This important problem has many practical applications. Here we propose a novel method and neural network model that enables better grasp success rates relative to what is available in the literature. The method takes standard point cloud… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: https://haojhuang.github.io/edge_grasp_page/

  17. arXiv:2207.11313  [pdf, other

    cs.RO

    Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks

    Authors: David Klee, Ondrej Biza, Robert Platt

    Abstract: Multi-goal policy learning for robotic manipulation is challenging. Prior successes have used state-based representations of the objects or provided demonstration data to facilitate learning. In this paper, by hand-coding a high-level discrete representation of the domain, we show that policies to reach dozens of goals can be learned with a single network using Q-learning from pixels. The agent fo… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  18. arXiv:2207.08925  [pdf, other

    cs.CV cs.LG

    Image to Icosahedral Projection for $\mathrm{SO}(3)$ Object Reasoning from Single-View Images

    Authors: David Klee, Ondrej Biza, Robert Platt, Robin Walters

    Abstract: Reasoning about 3D objects based on 2D images is challenging due to variations in appearance caused by viewing the object from different orientations. Tasks such as object classification are invariant to 3D rotations and other such as pose estimation are equivariant. However, imposing equivariance as a model constraint is typically not possible with 2D image input because we do not have an a prior… ▽ More

    Submitted 15 November, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

  19. arXiv:2206.14802  [pdf, other

    cs.RO cs.AI cs.LG

    Visual Foresight With a Local Dynamics Model

    Authors: Colin Kohler, Robert Platt

    Abstract: Model-free policy learning has been shown to be capable of learning manipulation policies which can solve long-time horizon tasks using single-step manipulation primitives. However, training these policies is a time-consuming process requiring large amounts of data. We propose the Local Dynamics Model (LDM) which efficiently learns the state-transition function for these manipulation primitives. B… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  20. arXiv:2206.01078  [pdf, other

    cs.LG cs.AI

    Deep Transformer Q-Networks for Partially Observable Reinforcement Learning

    Authors: Kevin Esslinger, Robert Platt, Christopher Amato

    Abstract: Real-world reinforcement learning tasks often involve some form of partial observability where the observations only give a partial or noisy view of the true state of the world. Such tasks typically require some form of memory, where the agent has access to multiple past observations, in order to perform well. One popular way to incorporate memory is by using a recurrent neural network to access t… ▽ More

    Submitted 10 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  21. arXiv:2205.14292  [pdf, other

    cs.RO

    BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework

    Authors: Dian Wang, Colin Kohler, Xupeng Zhu, Mingxi Jia, Robert Platt

    Abstract: We present BulletArm, a novel benchmark and learning-environment for robotic manipulation. BulletArm is designed around two key principles: reproducibility and extensibility. We aim to encourage more direct comparisons between robotic learning methods by providing a set of standardized benchmark tasks in simulation alongside a collection of baseline algorithms. The framework consists of 31 differe… ▽ More

    Submitted 17 October, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Published at ISRR 2022

  22. arXiv:2204.13022  [pdf, other

    cs.LG

    Binding Actions to Objects in World Models

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong, Thomas Kipf

    Abstract: We study the problem of binding actions to objects in object-factored world models using action-attention mechanisms. We propose two attention mechanisms for binding actions to objects, soft attention and hard attention, which we evaluate in the context of structured world models for five environments. Our experiments show that hard attention helps contrastively-trained structured world models to… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Published at the ICLR 2022 workshop on Objects, Structure and Causality

  23. Efficient and Accurate Candidate Generation for Grasp Pose Detection in SE(3)

    Authors: Andreas ten Pas, Colin Keil, Robert Platt

    Abstract: Grasp detection of novel objects in unstructured environments is a key capability in robotic manipulation. For 2D grasp detection problems where grasps are assumed to lie in the plane, it is common to design a fully convolutional neural network that predicts grasps over an entire image in one step. However, this is not possible for grasp pose detection where grasp poses are assumed to exist in SE(… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

  24. arXiv:2204.00898  [pdf, other

    cs.RO

    Hierarchical Reinforcement Learning under Mixed Observability

    Authors: Hai Nguyen, Zhihan Yang, Andrea Baisero, Xiao Ma, Robert Platt, Christopher Amato

    Abstract: The framework of mixed observable Markov decision processes (MOMDP) models many robotic domains in which some state variables are fully observable while others are not. In this work, we identify a significant subclass of MOMDPs defined by how actions influence the fully observable components of the state and how those, in turn, influence the partially observable components and the rewards. This un… ▽ More

    Submitted 4 June, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Accepted at the 15th International Workshop on the Algorithmic Foundations of Robotics (WAFR) 2022, University of Maryland, College Park. The first two authors contributed equally

  25. arXiv:2203.10685  [pdf, other

    cs.RO

    Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation

    Authors: Tarik Kelestemur, Robert Platt, Taskin Padir

    Abstract: Object pose estimation methods allow finding locations of objects in unstructured environments. This is a highly desired skill for autonomous robot manipulation as robots need to estimate the precise poses of the objects in order to manipulate them. In this paper, we investigate the problems of tactile pose estimation and manipulation for category-level objects. Our proposed method uses a Bayes fi… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted atthe 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022)

  26. arXiv:2203.04923  [pdf, other

    cs.RO

    On-Robot Learning With Equivariant Models

    Authors: Dian Wang, Mingxi Jia, Xupeng Zhu, Robin Walters, Robert Platt

    Abstract: Recently, equivariant neural network models have been shown to improve sample efficiency for tasks in computer vision and reinforcement learning. This paper explores this idea in the context of on-robot policy learning in which a policy must be learned entirely on a physical robotic system without reference to a model, a simulator, or an offline dataset. We focus on applications of Equivariant SAC… ▽ More

    Submitted 17 October, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Published at CoRL 2022

  27. arXiv:2203.04439  [pdf, other

    cs.RO

    $\mathrm{SO}(2)$-Equivariant Reinforcement Learning

    Authors: Dian Wang, Robin Walters, Robert Platt

    Abstract: Equivariant neural networks enforce symmetry within the structure of their convolutional layers, resulting in a substantial improvement in sample efficiency when learning an equivariant or invariant function. Such models are applicable to robotic manipulation learning which can often be formulated as a rotationally symmetric problem. This paper studies equivariant model architectures in the contex… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Published at ICLR 2022

  28. arXiv:2202.09468  [pdf, other

    cs.RO

    Sample Efficient Grasp Learning Using Equivariant Models

    Authors: Xupeng Zhu, Dian Wang, Ondrej Biza, Guanang Su, Robin Walters, Robert Platt

    Abstract: In planar grasp detection, the goal is to learn a function from an image of a scene onto a set of feasible grasp poses in $\mathrm{SE}(2)$. In this paper, we recognize that the optimal grasp function is $\mathrm{SE}(2)$-equivariant and can be modeled using an equivariant convolutional neural network. As a result, we are able to significantly improve the sample efficiency of grasp learning, obtaini… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  29. arXiv:2202.09400  [pdf, other

    cs.RO cs.CV cs.LG

    Equivariant Transporter Network

    Authors: Haojie Huang, Dian Wang, Robin Walters, Robert Platt

    Abstract: Transporter Net is a recently proposed framework for pick and place that is able to learn good manipulation policies from a very few expert demonstrations. A key reason why Transporter Net is so sample efficient is that the model incorporates rotational equivariance into the pick module, i.e. the model immediately generalizes learned pick knowledge to objects presented in different orientations. T… ▽ More

    Submitted 20 September, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: Project Website: https://haojhuang.github.io/etp_page/

    Journal ref: RSS 2022

  30. arXiv:2202.05333  [pdf, other

    cs.RO cs.LG

    Factored World Models for Zero-Shot Generalization in Robotic Manipulation

    Authors: Ondrej Biza, Thomas Kipf, David Klee, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: World models for environments with many objects face a combinatorial explosion of states: as the number of objects increases, the number of possible arrangements grows exponentially. In this paper, we learn to generalize over robotic pick-and-place tasks using object-factored world models, which combat the combinatorial explosion by ensuring that predictions are equivariant to permutations of obje… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  31. arXiv:2201.07937  [pdf, other

    cs.CV

    GASCN: Graph Attention Shape Completion Network

    Authors: Haojie Huang, Ziyi Yang, Robert Platt

    Abstract: Shape completion, the problem of inferring the complete geometry of an object given a partial point cloud, is an important problem in robotics and computer vision. This paper proposes the Graph Attention Shape Completion Network (GASCN), a novel neural network model that solves this problem. This model combines a graph-based model for encoding local point cloud information with an MLP-based archit… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: International Conference on 3D Vision (3DV)

  32. arXiv:2110.15443  [pdf, other

    cs.RO

    Equivariant $Q$ Learning in Spatial Action Spaces

    Authors: Dian Wang, Robin Walters, Xupeng Zhu, Robert Platt

    Abstract: Recently, a variety of new equivariant neural network model architectures have been proposed that generalize better over rotational and reflectional symmetries than standard models. These models are relevant to robotics because many robotics problems can be expressed in a rotationally symmetric way. This paper focuses on equivariance over a visual state space and a spatial action space -- the sett… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted at Conference on Robot Learning (CoRL) 2021

  33. arXiv:2101.04178  [pdf, other

    cs.RO cs.LG

    Action Priors for Large Action Spaces in Robotics

    Authors: Ondrej Biza, Dian Wang, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: In robotics, it is often not possible to learn useful policies using pure model-free reinforcement learning without significant reward sha** or curriculum learning. As a consequence, many researchers rely on expert demonstrations to guide learning. However, acquiring expert demonstrations can be expensive. This paper proposes an alternative approach where the solutions of previously solved tasks… ▽ More

    Submitted 15 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 13 pages, 9 figures

    Journal ref: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '21). 2021. 205 - 213

  34. arXiv:2011.05559  [pdf, other

    cs.RO

    Learning Bayes Filter Models for Tactile Localization

    Authors: Tarik Kelestemur, Colin Keil, John P. Whitney, Robert Platt, Taskin Padir

    Abstract: Localizing and tracking the pose of robotic grippers are necessary skills for manipulation tasks. However, the manipulators with imprecise kinematic models (e.g. low-cost arms) or manipulators with unknown world coordinates (e.g. poor camera-arm calibration) cannot locate the gripper with respect to the world. In these circumstances, we can leverage tactile feedback between the gripper and the env… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted in IROS2020

  35. arXiv:2010.09170  [pdf, other

    cs.RO cs.AI cs.LG

    Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability

    Authors: Hai Nguyen, Brett Daley, Xinchao Song, Christopher Amato, Robert Platt

    Abstract: Many important robotics problems are partially observable in the sense that a single visual or force-feedback measurement is insufficient to reconstruct the state. Standard approaches involve learning a policy over beliefs or observation-action histories. However, both of these have drawbacks; it is expensive to track the belief online, and it is hard to learn policies directly over histories. We… ▽ More

    Submitted 20 October, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted at Conference on Robot Learning (CoRL) 2020

  36. Robotic Pick-and-Place With Uncertain Object Instance Segmentation and Shape Completion

    Authors: Marcus Gualtieri, Robert Platt

    Abstract: We consider robotic pick-and-place of partially visible, novel objects, where goal placements are non-trivial, e.g., tightly packed into a bin. One approach is (a) use object instance segmentation and shape completion to model the objects and (b) use a regrasp planner to decide grasps and places displacing the models to their goals. However, it is critical for the planner to account for uncertaint… ▽ More

    Submitted 3 March, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Supplementary material available for download: source code (https://github.com/mgualti/GeomPickPlace), supplemental results (https://github.com/mgualti/GeomPickPlace/raw/main/Notes/supplemental.pdf), and video (https://youtu.be/OBGf7L3iKsM)

    Journal ref: IEEE Robotics and Automation Letters (2021)

  37. arXiv:2010.02798  [pdf, other

    cs.RO

    Policy learning in SE(3) action spaces

    Authors: Dian Wang, Colin Kohler, Robert Platt

    Abstract: In the spatial action representation, the action space spans the space of target poses for robot motion commands, i.e. SE(2) or SE(3). This approach has been used to solve challenging robotic manipulation problems and shows promise. However, the method is often limited to a three dimensional action space and short horizon tasks. This paper proposes ASRSE3, a new method for handling higher dimensio… ▽ More

    Submitted 4 November, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 17 pages, accepted at CoRL 2020

  38. arXiv:2005.11810  [pdf, other

    cs.RO cs.AI cs.LG

    Learning visual servo policies via planner cloning

    Authors: Ulrich Viereck, Kate Saenko, Robert Platt

    Abstract: Learning control policies for visual servoing in novel environments is an important problem. However, standard model-free policy learning methods are slow. This paper explores planner cloning: using behavior cloning to learn policies that mimic the behavior of a full-state motion planner in simulation. We propose Penalized Q Cloning (PQC), a new behavior cloning algorithm. We show that it outperfo… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

  39. arXiv:2003.04300  [pdf, other

    cs.LG stat.ML

    Learning Discrete State Abstractions With Deep Variational Inference

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose an information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural encoder to map states onto continuous embeddings. We map these embeddings onto a discrete representation using an action-conditioned hidden Markov model… ▽ More

    Submitted 11 January, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 15 pages, 7 figures

  40. Learning Manipulation Skills Via Hierarchical Spatial Attention

    Authors: Marcus Gualtieri, Robert Platt

    Abstract: Learning generalizable skills in robotic manipulation has long been challenging due to real-world sized observation and action spaces. One method for addressing this problem is attention focus -- the robot learns where to attend its sensors and irrelevant details are ignored. However, these methods have largely not caught on due to the difficulty of learning a good attention policy and the added p… ▽ More

    Submitted 3 March, 2020; v1 submitted 19 April, 2019; originally announced April 2019.

    Comments: IEEE Transactions on Robotics, March 2020. Video: https://youtu.be/4dZ6WiDX3-s . Source code: https://github.com/mgualti/Seq6DofManip

  41. arXiv:1811.12929  [pdf, other

    cs.LG stat.ML

    Online Abstraction with MDP Homomorphisms for Deep Learning

    Authors: Ondrej Biza, Robert Platt

    Abstract: Abstraction of Markov Decision Processes is a useful tool for solving complex problems, as it can ignore unimportant aspects of an environment, simplifying the process of learning an optimal policy. In this paper, we propose a new algorithm for finding abstract MDPs in environments with continuous state spaces. It is based on MDP homomorphisms, a structure-preserving map** between MDPs. We demon… ▽ More

    Submitted 3 April, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '19). 2019. 1125 - 1133

  42. arXiv:1810.03400  [pdf, other

    cs.RO

    AutOTranS: an Autonomous Open World Transportation System

    Authors: Brayan S. Zapata-Impata, Vikrant Shah, Hanumant Singh, Robert Platt

    Abstract: Tasks in outdoor open world environments are now ripe for automation with mobile manipulators. The dynamic, unstructured and unknown environments associated with such tasks -- a prime example would be collecting roadside trash -- makes them particularly challenging. In this paper we present an approach to solving the problem of picking up, transporting, and drop** off novel objects outdoors. Our… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 7 pages, 10 figures, submitted to ICRA 2019

  43. arXiv:1809.09541  [pdf, other

    cs.RO

    Towards Assistive Robotic Pick and Place in Open World Environments

    Authors: Dian Wang, Colin Kohler, Andreas ten Pas, Alexander Wilkinson, Maozhi Liu, Holly Yanco, Robert Platt

    Abstract: Assistive robot manipulators must be able to autonomously pick and place a wide range of novel objects to be truly useful. However, current assistive robots lack this capability. Additionally, assistive systems need to have an interface that is easy to learn, to use, and to understand. This paper takes a step forward in this direction. We present a robot system comprised of a robotic arm and a mob… ▽ More

    Submitted 8 July, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: 16 pages, 14 figures, submitted to The International Symposium on Robotics Research (ISRR) 2019

  44. arXiv:1807.10413  [pdf, other

    cs.RO cs.AI cs.CV

    Adapting control policies from simulation to reality using a pairwise loss

    Authors: Ulrich Viereck, Xingchao Peng, Kate Saenko, Robert Platt

    Abstract: This paper proposes an approach to domain transfer based on a pairwise loss function that helps transfer control policies learned in simulation onto a real robot. We explore the idea in the context of a 'category level' manipulation task where a control policy is learned that enables a robot to perform a mating task involving novel objects. We explore the case where depth images are used as the ma… ▽ More

    Submitted 26 October, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

  45. arXiv:1806.10045  [pdf, other

    cs.RO

    Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies

    Authors: Robert Platt, Colin Kohler, Marcus Gualtieri

    Abstract: In applications of deep reinforcement learning to robotics, it is often the case that we want to learn pose invariant policies: policies that are invariant to changes in the position and orientation of objects in the world. For example, consider a peg-in-hole insertion task. If the agent learns to insert a peg into one hole, we would like that policy to generalize to holes presented in different p… ▽ More

    Submitted 17 September, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

  46. arXiv:1806.06134  [pdf, other

    cs.RO

    Learning 6-DoF Gras** and Pick-Place Using Attention Focus

    Authors: Marcus Gualtieri, Robert Platt

    Abstract: We address a class of manipulation problems where the robot perceives the scene with a depth sensor and can move its end effector in a space with six degrees of freedom -- 3D position and orientation. Our approach is to formulate the problem as a Markov decision process (MDP) with abstract yet generally applicable state and action representations. Finding a good solution to the MDP requires adding… ▽ More

    Submitted 27 September, 2018; v1 submitted 15 June, 2018; originally announced June 2018.

  47. arXiv:1805.08180   

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Hierarchical Reinforcement Learning with Hindsight

    Authors: Andrew Levy, Robert Platt, Kate Saenko

    Abstract: Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a sample efficient and automated fashion. Our approach combines universal value functions and hindsight learning, allowing agents to learn policies belonging to differ… ▽ More

    Submitted 8 March, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Duplicate. See arXiv:1712.00948 "Learning Multi-Level Hierarchies with Hindsight" for latest version

  48. arXiv:1712.00948  [pdf, other

    cs.AI cs.LG cs.NE cs.RO

    Learning Multi-Level Hierarchies with Hindsight

    Authors: Andrew Levy, George Konidaris, Robert Platt, Kate Saenko

    Abstract: Hierarchical agents have the potential to solve sequential decision making tasks with greater sample efficiency than their non-hierarchical counterparts because hierarchical agents can break down tasks into sets of subtasks that only require short sequences of decisions. In order to realize this potential of faster learning, hierarchical agents need to be able to learn their multiple levels of pol… ▽ More

    Submitted 3 September, 2019; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: ICLR 2019 Accepted Paper

  49. arXiv:1707.05615  [pdf, other

    cs.RO

    Pick and Place Without Geometric Object Models

    Authors: Marcus Gualtieri, Andreas ten Pas, Robert Platt

    Abstract: We propose a novel formulation of robotic pick and place as a deep reinforcement learning (RL) problem. Whereas most deep RL approaches to robotic manipulation frame the problem in terms of low level states and actions, we propose a more abstract formulation. In this formulation, actions are target reach poses for the hand and states are a history of such reaches. We show this approach can solve a… ▽ More

    Submitted 22 February, 2018; v1 submitted 18 July, 2017; originally announced July 2017.

  50. arXiv:1706.09911  [pdf, other

    cs.RO

    Grasp Pose Detection in Point Clouds

    Authors: Andreas ten Pas, Marcus Gualtieri, Kate Saenko, Robert Platt

    Abstract: Recently, a number of grasp detection methods have been proposed that can be used to localize robotic grasp configurations directly from sensor data without estimating object pose. The underlying idea is to treat grasp perception analogously to object detection in computer vision. These methods take as input a noisy and partially occluded RGBD image or point cloud and produce as output pose estima… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: arXiv admin note: text overlap with arXiv:1603.01564