Skip to main content

Showing 1–30 of 30 results for author: Seita, D

.
  1. arXiv:2407.04152  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation

    Authors: I-Chun Arthur Liu, Sicheng He, Daniel Seita, Gaurav Sukhatme

    Abstract: Bimanual manipulation is critical to many robotics applications. In contrast to single-arm manipulation, bimanual manipulation tasks are challenging due to higher-dimensional action spaces. Prior works leverage large amounts of data and primitive actions to address this problem, but may suffer from sample inefficiency and limited generalization across various tasks. To this end, we propose VoxAct-… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2407.01898  [pdf, other

    cs.RO

    Learning Granular Media Avalanche Behavior for Indirectly Manipulating Obstacles on a Granular Slope

    Authors: Haodi Hu, Feifei Qian, Daniel Seita

    Abstract: Legged robot locomotion on sand slopes is challenging due to the complex dynamics of granular media and how the lack of solid surfaces can hinder locomotion. A promising strategy, inspired by ghost crabs and other organisms in nature, is to strategically interact with rocks, debris, and other obstacles to facilitate movement. To provide legged robots with this ability, we present a novel approach… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Submitted to CoRL 2024

  3. arXiv:2406.09640  [pdf, other

    cs.RO

    GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

    Authors: Vedant Raval, Enyu Zhao, Hejia Zhang, Stefanos Nikolaidis, Daniel Seita

    Abstract: Fabric manipulation has applications in folding blankets, handling patient clothing, and protecting items with covers. It is challenging for robots to perform fabric manipulation since fabrics have infinite-dimensional configuration spaces, complex dynamics, and may be in folded or crumpled configurations with severe self-occlusions. Prior work on robotic fabric manipulation relies either on heavi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Code, prompts, and videos are available at https://tinyurl.com/gptfab

  4. arXiv:2405.09581  [pdf, other

    cs.RO

    Self-Supervised Learning of Dynamic Planar Manipulation of Free-End Cables

    Authors: Jonathan Wang, Huang Huang, Vincent Lim, Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Yunliang Chen, Ken Goldberg

    Abstract: Dynamic manipulation of free-end cables has applications for cable management in homes, warehouses and manufacturing plants. We present a supervised learning approach for dynamic manipulation of free-end cables, focusing on the problem of getting the cable endpoint to a designated target position, which may lie outside the reachable workspace of the robot end effector. We present a simulator, tune… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  5. arXiv:2403.16188  [pdf, other

    cs.CV

    Cross-domain Multi-modal Few-shot Object Detection via Rich Text

    Authors: Zeyu Shangguan, Daniel Seita, Mohammad Rostami

    Abstract: Cross-modal feature extraction and integration have led to steady performance improvements in few-shot learning tasks due to generating richer features. However, existing multi-modal object detection (MM-OD) methods degrade when facing significant domain-shift and are sample insufficient. We hypothesize that rich text information could more effectively help the model to build a knowledge relations… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  6. arXiv:2303.16898  [pdf, other

    cs.RO

    Bagging by Learning to Singulate Layers Using Interactive Perception

    Authors: Lawrence Yunliang Chen, Baiyu Shi, Roy Lin, Daniel Seita, Ayah Ahmad, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

    Abstract: Many fabric handling and 2D deformable material tasks in homes and industry require singulating layers of material such as opening a bag or arranging garments for sewing. In contrast to methods requiring specialized sensing or end effectors, we use only visual observations with ordinary parallel jaw grippers. We propose SLIP: Singulating Layers using Interactive Perception, and apply SLIP to the t… ▽ More

    Submitted 1 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: IROS 2023

  7. arXiv:2211.09006  [pdf, other

    cs.RO

    ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow from Point Clouds

    Authors: Daniel Seita, Yufei Wang, Sarthak J. Shetty, Edward Yao Li, Zackory Erickson, David Held

    Abstract: Point clouds are a widely available and canonical data modality which convey the 3D geometry of a scene. Despite significant progress in classification and segmentation from point clouds, policy learning from such a modality remains challenging, and most prior works in imitation learning focus on learning policies from images or state information. In this paper, we propose a novel framework for le… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Conference on Robot Learning (CoRL), 2022. Supplementary material is available at https://sites.google.com/view/point-cloud-policy/home

  8. arXiv:2210.17217  [pdf, other

    cs.RO

    AutoBag: Learning to Open Plastic Bags and Insert Objects

    Authors: Lawrence Yunliang Chen, Baiyu Shi, Daniel Seita, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

    Abstract: Thin plastic bags are ubiquitous in retail stores, healthcare, food handling, recycling, homes, and school lunchrooms. They are challenging both for perception (due to specularities and occlusions) and for manipulation (due to the dynamics of their 3D deformable structure). We formulate the task of "bagging:" manipulating common plastic shop** bags with two handles from an unstructured initial s… ▽ More

    Submitted 19 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: ICRA 2023

  9. arXiv:2207.11196  [pdf, other

    cs.RO

    Learning to Singulate Layers of Cloth using Tactile Feedback

    Authors: Sashank Tirumala, Thomas Weng, Daniel Seita, Oliver Kroemer, Zeynep Temel, David Held

    Abstract: Robotic manipulation of cloth has applications ranging from fabrics manufacturing to handling blankets and laundry. Cloth manipulation is challenging for robots largely due to their high degrees of freedom, complex dynamics, and severe self-occlusions when in folded or crumpled configurations. Prior work on robotic manipulation of cloth relies primarily on vision sensors alone, which may pose chal… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: IROS 2022. See https://sites.google.com/view/reskin-cloth for supplementary material

  10. arXiv:2206.08921  [pdf, other

    cs.RO

    Efficiently Learning Single-Arm Fling Motions to Smooth Garments

    Authors: Lawrence Yunliang Chen, Huang Huang, Ellen Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, Thomas Kollar, Ken Goldberg

    Abstract: Recent work has shown that 2-arm "fling" motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment g… ▽ More

    Submitted 24 September, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to 2022 International Symposium on Robotics Research (ISRR)

  11. arXiv:2111.04814  [pdf, other

    cs.RO

    Planar Robot Casting with Real2Sim2Real Self-Supervised Learning

    Authors: Vincent Lim, Huang Huang, Lawrence Yunliang Chen, Jonathan Wang, Jeffrey Ichnowski, Daniel Seita, Michael Laskey, Ken Goldberg

    Abstract: This paper introduces the task of {\em Planar Robot Casting (PRC)}: where one planar motion of a robot arm holding one end of a cable causes the other end to slide across the plane toward a desired target. PRC allows the cable to reach points beyond the robot workspace and has applications for cable management in homes, warehouses, and factories. To efficiently learn a PRC policy for a given cable… ▽ More

    Submitted 25 June, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  12. arXiv:2109.07380  [pdf, other

    cs.LG cs.RO

    DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning

    Authors: Daniel Seita, Abhinav Gopal, Zhao Mandi, John Canny

    Abstract: Deep reinforcement learning (RL) has shown great empirical successes, but suffers from brittleness and sample inefficiency. A potential remedy is to use a previously-trained policy as a source of supervision. In this work, we refer to these policies as teachers and study how to transfer their expertise to new student policies by focusing on data usage. We propose a framework, Data CUrriculum for R… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Supplementary material is available at https://tinyurl.com/teach-dcur

  13. arXiv:2104.00053  [pdf, other

    cs.RO cs.AI

    LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Carl Putterman, Michael Luo, Daniel S. Brown, Daniel Seita, Brijen Thananjeyan, Ellen Novoseller, Ken Goldberg

    Abstract: Corrective interventions while a robot is learning to automate a task provide an intuitive method for a human supervisor to assist the robot and convey information about desired behavior. However, these interventions can impose significant burden on a human supervisor, as each intervention interrupts other work the human is doing, incurs latency with each context switch between supervisor and auto… ▽ More

    Submitted 20 July, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: IEEE CASE 2021

  14. arXiv:2102.09754  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Physical Sequential Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We build upon the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different sequential fabric manipu… ▽ More

    Submitted 20 July, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: Journal extension of prior work on VSF to appear in Autonomous Robots S.I. 207. arXiv admin note: text overlap with arXiv:2003.09044

  15. Automating Surgical Peg Transfer: Calibration with Deep Learning Can Exceed Speed, Accuracy, and Consistency of Humans

    Authors: Minho Hwang, Jeffrey Ichnowski, Brijen Thananjeyan, Daniel Seita, Samuel Paradis, Danyal Fer, Thomas Low, Ken Goldberg

    Abstract: Peg transfer is a well-known surgical training task in the Fundamentals of Laparoscopic Surgery (FLS). While human sur-geons teleoperate robots such as the da Vinci to perform this task with high speed and accuracy, it is challenging to automate. This paper presents a novel system and control method using a da Vinci Research Kit (dVRK) surgical robot and a Zivid depth sensor, and a human subjects… ▽ More

    Submitted 15 May, 2022; v1 submitted 23 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Automation Science and Engineering (2022)

  16. arXiv:2012.03385  [pdf, other

    cs.RO cs.LG

    Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks

    Authors: Daniel Seita, Pete Florence, Jonathan Tompson, Erwin Coumans, Vikas Sindhwani, Ken Goldberg, Andy Zeng

    Abstract: Rearranging and manipulating deformable objects such as cables, fabrics, and bags is a long-standing challenge in robotic manipulation. The complex dynamics and high-dimensional configuration spaces of deformables, compared to rigid objects, make manipulation difficult not only for multi-step planning, but even for goal specification. Goals cannot be as easily specified as rigid object poses, and… ▽ More

    Submitted 18 June, 2023; v1 submitted 6 December, 2020; originally announced December 2020.

    Comments: See https://berkeleyautomation.github.io/bags/ for project website and code; v3 is ICRA 2021 version and v4 adds physical experiments and improves simulation results

  17. arXiv:2011.06163  [pdf, other

    cs.RO

    Intermittent Visual Servoing: Efficiently Learning Policies Robust to Instrument Changes for High-precision Surgical Manipulation

    Authors: Samuel Paradis, Minho Hwang, Brijen Thananjeyan, Jeffrey Ichnowski, Daniel Seita, Danyal Fer, Thomas Low, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Automation of surgical tasks using cable-driven robots is challenging due to backlash, hysteresis, and cable tension, and these issues are exacerbated as surgical instruments must often be changed during an operation. In this work, we propose a framework for automation of high-precision surgical tasks by learning sample efficient, accurate, closed-loop policies that operate directly on visual feed… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 6 pages, 5 figures, 4 tables, submitted to ICRA 2021, supplementary material at https://tinyurl.com/ivs-icra

  18. arXiv:2011.04840  [pdf, other

    cs.RO cs.AI

    Robots of the Lost Arc: Self-Supervised Learning to Dynamically Manipulate Fixed-Endpoint Cables

    Authors: Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Jonathan Wang, Huang Huang, Ken Goldberg

    Abstract: We explore how high-speed robot arm motions can dynamically manipulate cables to vault over obstacles, knock objects from pedestals, and weave between obstacles. In this paper, we propose a self-supervised learning framework that enables a UR5 robot to perform these three tasks. The framework finds a 3D apex point for the robot arm, which, together with a task-specific trajectory function, defines… ▽ More

    Submitted 1 May, 2024; v1 submitted 9 November, 2020; originally announced November 2020.

  19. arXiv:2010.04339  [pdf, other

    cs.CV cs.RO

    MMGSD: Multi-Modal Gaussian Shape Descriptors for Correspondence Matching in 1D and 2D Deformable Objects

    Authors: Aditya Ganapathi, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Daniel Seita, Ryan Hoque, Joseph E. Gonzalez, Ken Goldberg

    Abstract: We explore learning pixelwise correspondences between images of deformable objects in different configurations. Traditional correspondence matching approaches such as SIFT, SURF, and ORB can fail to provide sufficient contextual information for fine-grained manipulation. We propose Multi-Modal Gaussian Shape Descriptor (MMGSD), a new visual representation of deformable objects which extends ideas… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: IROS 2020 Workshop on Managing Deformation: A Step Towards Higher Robot Autonomy

  20. arXiv:2003.12698  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Dense Visual Correspondences in Simulation to Smooth and Fold Real Fabrics

    Authors: Aditya Ganapathi, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Daniel Seita, Jennifer Grannen, Minho Hwang, Ryan Hoque, Joseph E. Gonzalez, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation is challenging due to the infinite dimensional configuration space, self-occlusion, and complex dynamics of fabrics. There has been significant prior work on learning policies for specific deformable manipulation tasks, but comparatively less focus on algorithms which can efficiently learn many different tasks. In this paper, we learn visual correspondences for deformab… ▽ More

    Submitted 11 November, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

  21. arXiv:2003.09044  [pdf, other

    cs.RO cs.AI cs.CV

    VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation

    Authors: Ryan Hoque, Daniel Seita, Ashwin Balakrishna, Aditya Ganapathi, Ajay Kumar Tanwani, Nawid Jamali, Katsu Yamane, Soshi Iba, Ken Goldberg

    Abstract: Robotic fabric manipulation has applications in home robotics, textiles, senior care and surgery. Existing fabric manipulation techniques, however, are designed for specific tasks, making it difficult to generalize across different but related tasks. We extend the Visual Foresight framework to learn fabric dynamics that can be efficiently reused to accomplish different fabric manipulation tasks wi… ▽ More

    Submitted 18 February, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Robotics: Science and Systems (RSS) 2020

  22. Efficiently Calibrating Cable-Driven Surgical Robots with RGBD Fiducial Sensing and Recurrent Neural Networks

    Authors: Minho Hwang, Brijen Thananjeyan, Samuel Paradis, Daniel Seita, Jeffrey Ichnowski, Danyal Fer, Thomas Low, Ken Goldberg

    Abstract: Automation of surgical subtasks using cable-driven robotic surgical assistants (RSAs) such as Intuitive Surgical's da Vinci Research Kit (dVRK) is challenging due to imprecision in control from cable-related effects such as cable stretching and hysteresis. We propose a novel approach to efficiently calibrate such robots by placing a 3D printed fiducial coordinate frames on the arm and end-effector… ▽ More

    Submitted 31 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: 8 pages, 11 figures, 3 tables

    Journal ref: IEEE Robotics and Automation Letters, 5 (2020) 5937-5944

  23. arXiv:2002.06302  [pdf, other

    cs.RO

    Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot

    Authors: Minho Hwang, Daniel Seita, Brijen Thananjeyan, Jeffrey Ichnowski, Samuel Paradis, Danyal Fer, Thomas Low, Ken Goldberg

    Abstract: Recent advances in depth-sensing have significantly increased accuracy, resolution, and frame rate, as shown in the 1920x1200 resolution and 13 frames per second Zivid RGBD camera. In this study, we explore the potential of depth sensing for efficient and reliable automation of surgical subtasks. We consider a monochrome (all red) version of the peg transfer task from the Fundamentals of Laparosco… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Camera-ready version for the International Symposium on Medical Robotics (ISMR) 2020

  24. arXiv:1910.12154  [pdf, other

    cs.LG cs.AI

    ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations

    Authors: Daniel Seita, David Chan, Roshan Rao, Chen Tang, Mandi Zhao, John Canny

    Abstract: Learning from demonstrations is a popular tool for accelerating and reducing the exploration requirements of reinforcement learning. When providing expert demonstrations to human students, we know that the demonstrations must fall within a particular range of difficulties called the "Zone of Proximal Development (ZPD)". If they are too easy the student learns nothing, but if they are too difficult… ▽ More

    Submitted 26 October, 2019; originally announced October 2019.

    Comments: Deep Reinforcement Learning Workshop at NeurIPS 2019

  25. arXiv:1910.04854  [pdf, other

    cs.RO cs.AI cs.CV

    Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor

    Authors: Daniel Seita, Aditya Ganapathi, Ryan Hoque, Minho Hwang, Edward Cen, Ajay Kumar Tanwani, Ashwin Balakrishna, Brijen Thananjeyan, Jeffrey Ichnowski, Nawid Jamali, Katsu Yamane, Soshi Iba, John Canny, Ken Goldberg

    Abstract: Sequential pulling policies to flatten and smooth fabrics have applications from surgery to manufacturing to home tasks such as bed making and folding clothes. Due to the complexity of fabric states and dynamics, we apply deep imitation learning to learn policies that, given color (RGB), depth (D), or combined color-depth (RGBD) images of a rectangular fabric sample, estimate pick points and pull… ▽ More

    Submitted 2 March, 2020; v1 submitted 23 September, 2019; originally announced October 2019.

    Comments: Supplementary material is available at https://sites.google.com/view/fabric-smoothing ; Version 2 has significant improvements with new results and figures

  26. arXiv:1904.00511  [pdf, other

    cs.LG cs.AI cs.RO

    Risk Averse Robust Adversarial Reinforcement Learning

    Authors: Xinlei Pan, Daniel Seita, Yang Gao, John Canny

    Abstract: Deep reinforcement learning has recently made significant progress in solving computer games and robotic control tasks. A known problem, though, is that policies overfit to the training environment and may not avoid rare, catastrophic events such as automotive accidents. A classical technique for improving the robustness of reinforcement learning algorithms is to train on a set of randomized envir… ▽ More

    Submitted 31 March, 2019; originally announced April 2019.

    Comments: ICRA 2019

  27. arXiv:1809.09810  [pdf, other

    cs.RO cs.AI

    Deep Transfer Learning of Pick Points on Fabric for Robot Bed-Making

    Authors: Daniel Seita, Nawid Jamali, Michael Laskey, Ajay Kumar Tanwani, Ron Berenstein, Prakash Baskaran, Soshi Iba, John Canny, Ken Goldberg

    Abstract: A fundamental challenge in manipulating fabric for clothes folding and textiles manufacturing is computing "pick points" to effectively modify the state of an uncertain manifold. We present a supervised deep transfer learning approach to locate pick points using depth images for invariance to color and texture. We consider the task of bed-making, where a robot sequentially grasps and pulls at pick… ▽ More

    Submitted 16 September, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: International Symposium on Robotics Research (ISRR) 2019. Expanded and revised version of arXiv:1711.02525 as well as earlier versions here under the title "Robot Bed-Making: Deep Transfer Learning Using Depth Sensing of Deformable Fabric". Project website at https://sites.google.com/view/bed-make

  28. arXiv:1709.06668  [pdf, other

    cs.RO

    Fast and Reliable Autonomous Surgical Debridement with Cable-Driven Robots Using a Two-Phase Calibration Procedure

    Authors: Daniel Seita, Sanjay Krishnan, Roy Fox, Stephen McKinley, John Canny, Ken Goldberg

    Abstract: Automating precision subtasks such as debridement (removing dead or diseased tissue fragments) with Robotic Surgical Assistants (RSAs) such as the da Vinci Research Kit (dVRK) is challenging due to inherent non-linearities in cable-driven systems. We propose and evaluate a novel two-phase coarse-to-fine calibration method. In Phase I (coarse), we place a red calibration marker on the end effector… ▽ More

    Submitted 24 February, 2018; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: Code, data, and videos are available at https://sites.google.com/view/calib-icra/. Final version for ICRA 2018

  29. arXiv:1610.06848  [pdf, other

    cs.LG stat.ML

    An Efficient Minibatch Acceptance Test for Metropolis-Hastings

    Authors: Daniel Seita, Xinlei Pan, Haoyu Chen, John Canny

    Abstract: We present a novel Metropolis-Hastings method for large datasets that uses small expected-size minibatches of data. Previous work on reducing the cost of Metropolis-Hastings tests yield variable data consumed per sample, with only constant factor reductions versus using the full dataset for each sample. Here we present a method that can be tuned to provide arbitrarily small batch sizes, by adjusti… ▽ More

    Submitted 9 July, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: Final version for UAI 2017

  30. arXiv:1511.06416  [pdf, other

    cs.LG stat.ML

    Fast Parallel SAME Gibbs Sampling on General Discrete Bayesian Networks

    Authors: Daniel Seita, Haoyu Chen, John Canny

    Abstract: A fundamental task in machine learning and related fields is to perform inference on Bayesian networks. Since exact inference takes exponential time in general, a variety of approximate methods are used. Gibbs sampling is one of the most accurate approaches and provides unbiased samples from the posterior but it has historically been too expensive for large models. In this paper, we present an opt… ▽ More

    Submitted 19 November, 2015; originally announced November 2015.