Skip to main content

Showing 1–7 of 7 results for author: Suh, H J T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.14079  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

    Authors: H. J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake

    Abstract: Gradient-based methods enable efficient search capabilities in high dimensions. However, in order to apply them effectively in offline optimization paradigms such as offline Reinforcement Learning (RL) or Imitation Learning (IL), we require a more careful consideration of how uncertainty estimation interplays with first-order methods that attempt to minimize them. We study smoothed distance to dat… ▽ More

    Submitted 16 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: Glen Chou, Hongkai Dai, and Lujie Yang contributed equally to this work. Accepted to CoRL 2023

  2. arXiv:2206.10787  [pdf, other

    cs.RO

    Global Planning for Contact-Rich Manipulation via Local Smoothing of Quasi-dynamic Contact Models

    Authors: Tao Pang, H. J. Terry Suh, Lujie Yang, Russ Tedrake

    Abstract: The empirical success of Reinforcement Learning (RL) in the setting of contact-rich manipulation leaves much to be understood from a model-based perspective, where the key difficulties are often attributed to (i) the explosion of contact modes, (ii) stiff, non-smooth contact dynamics and the resulting exploding / discontinuous gradients, and (iii) the non-convexity of the planning problem. The sto… ▽ More

    Submitted 27 February, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: The first two authors contributed equally to this work

  3. arXiv:2202.00817  [pdf, other

    cs.LG cs.AI cs.RO

    Do Differentiable Simulators Give Better Policy Gradients?

    Authors: H. J. Terry Suh, Max Simchowitz, Kaiqing Zhang, Russ Tedrake

    Abstract: Differentiable simulators promise faster computation time for reinforcement learning by replacing zeroth-order gradient estimates of a stochastic objective with an estimate based on first-order gradients. However, it is yet unclear what factors decide the performance of the two estimators on complex landscapes that involve long-horizon planning and control on physical systems, despite the crucial… ▽ More

    Submitted 22 August, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Accepted to ICML 2022

    Journal ref: ICML 2022

  4. arXiv:2111.01376  [pdf, other

    cs.RO

    SEED: Series Elastic End Effectors in 6D for Visuotactile Tool Use

    Authors: H. J. Terry Suh, Naveen Kuppuswamy, Tao Pang, Paul Mitiguy, Alex Alspach, Russ Tedrake

    Abstract: We propose the framework of Series Elastic End Effectors in 6D (SEED), which combines a spatially compliant element with visuotactile sensing to grasp and manipulate tools in the wild. Our framework generalizes the benefits of series elasticity to 6-dof, while providing an abstraction of control using visuotactile sensing. We propose an algorithm for relative pose estimation from visuotactile sens… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Submitted to Robosoft 2022

  5. arXiv:2109.05143  [pdf, other

    cs.RO

    Bundled Gradients through Contact via Randomized Smoothing

    Authors: H. J. Terry Suh, Tao Pang, Russ Tedrake

    Abstract: The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from… ▽ More

    Submitted 21 January, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: The first two authors contributed equally. Accepted to Robotics and Automation Letters (RA-L)

  6. arXiv:2002.09093  [pdf, other

    cs.RO

    The Surprising Effectiveness of Linear Models for Visual Foresight in Object Pile Manipulation

    Authors: H. J. Terry Suh, Russ Tedrake

    Abstract: In this paper, we tackle the problem of pushing piles of small objects into a desired target set using visual feedback. Unlike conventional single-object manipulation pipelines, which estimate the state of the system parametrized by pose, the underlying physical state of this system is difficult to observe from images. Thus, we take the approach of reasoning directly in the space of images, and ac… ▽ More

    Submitted 15 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted to Workshop on Algorithmic Foundations of Robotics (WAFR) 2020, Video link: https://www.youtube.com/watch?v=HfFSnsnR590

  7. arXiv:1909.10209  [pdf, other

    cs.RO

    Energy-Efficient Motion Planning for Multi-Modal Hybrid Locomotion

    Authors: H. J. Terry Suh, Xiaobin Xiong, Andrew Singletary, Aaron D. Ames, Joel W. Burdick

    Abstract: Hybrid locomotion, which combines multiple modalities of locomotion within a single robot, enables robots to carry out complex tasks in diverse environments. This paper presents a novel method for planning multi-modal locomotion trajectories using approximate dynamic programming. We formulate this problem as a shortest-path search through a state-space graph, where the edge cost is assigned as opt… ▽ More

    Submitted 4 August, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: Accepted to International Conference on Intelligent Robots and Systems (IROS) 2020