Skip to main content

Showing 1–37 of 37 results for author: Ratliff, N

.
  1. arXiv:2407.02274  [pdf, other

    cs.RO

    DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Gras** with Geometric Fabrics

    Authors: Tyler Ga Wei Lum, Martin Matak, Viktor Makoviychuk, Ankur Handa, Arthur Allshire, Tucker Hermans, Nathan D. Ratliff, Karl Van Wyk

    Abstract: A pivotal challenge in robotics is achieving fast, safe, and robust dexterous gras** across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous gras** policy trained entir… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2405.02250  [pdf, other

    cs.RO

    Geometric Fabrics: a Safe Guiding Medium for Policy Learning

    Authors: Karl Van Wyk, Ankur Handa, Viktor Makoviychuk, Yijie Guo, Arthur Allshire, Nathan D. Ratliff

    Abstract: Robotics policies are always subjected to complex, second order dynamics that entangle their actions with resulting states. In reinforcement learning (RL) contexts, policies have the burden of deciphering these complicated interactions over massive amounts of experience and complex reward functions to learn how to accomplish tasks. Moreover, policies typically issue actions directly to controllers… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2310.17274  [pdf, other

    cs.RO cs.AR cs.DC

    cuRobo: Parallelized Collision-Free Minimum-Jerk Robot Motion Generation

    Authors: Balakumar Sundaralingam, Siva Kumar Sastry Hari, Adam Fishman, Caelan Garrett, Karl Van Wyk, Valts Blukis, Alexander Millane, Helen Oleynikova, Ankur Handa, Fabio Ramos, Nathan Ratliff, Dieter Fox

    Abstract: This paper explores the problem of collision-free motion generation for manipulators by formulating it as a global motion optimization problem. We develop a parallel optimization technique to solve this problem and demonstrate its effectiveness on massively parallel GPUs. We show that combining simple optimization techniques with many parallel seeds leads to solving difficult motion generation pro… ▽ More

    Submitted 3 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: revised technical report, 62 pages, Website: https://curobo.org

  4. arXiv:2309.07368  [pdf, ps, other

    cs.RO

    Fabrics: A Foundationally Stable Medium for Encoding Prior Experience

    Authors: Nathan Ratliff, Karl Van Wyk

    Abstract: Most dynamics functions are not well-aligned to task requirements. Controllers, therefore, often invert the dynamics and reshape it into something more useful. The learning community has found that these controllers, such as Operational Space Control (OSC), can offer important inductive biases for training. However, OSC only captures straight line end-effector motion. There's a lot more behavior w… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  5. arXiv:2109.10443  [pdf, other

    cs.RO eess.SY

    Geometric Fabrics: Generalizing Classical Mechanics to Capture the Physics of Behavior

    Authors: Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana, Buck Babich, Bryan Peele, Qian Wan, Iretiayo Akinola, Balakumar Sundaralingam, Dieter Fox, Byron Boots, Nathan D. Ratliff

    Abstract: Classical mechanical systems are central to controller design in energy sha** methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guara… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  6. arXiv:2105.03019  [pdf, other

    cs.RO cs.LG

    Imitation Learning via Simultaneous Optimization of Policies and Auxiliary Trajectories

    Authors: Mandy Xie, Anqi Li, Karl Van Wyk, Frank Dellaert, Byron Boots, Nathan Ratliff

    Abstract: Imitation learning (IL) is a frequently used approach for data-efficient policy learning. Many IL methods, such as Dataset Aggregation (DAgger), combat challenges like distributional shift by interacting with oracular experts. Unfortunately, assuming access to oracular experts is often unrealistic in practice; data used in IL frequently comes from offline processes such as lead-through or teleoper… ▽ More

    Submitted 5 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

  7. arXiv:2104.13542  [pdf, other

    cs.RO

    STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation

    Authors: Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots

    Abstract: Sampling-based model-predictive control (MPC) is a promising tool for feedback control of robots with complex, non-smooth dynamics, and cost functions. However, the computationally demanding nature of sampling-based MPC algorithms has been a key bottleneck in their application to high-dimensional robotic manipulation problems in the real world. Previous methods have addressed this issue by running… ▽ More

    Submitted 14 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted for oral presentation at the Conference on Robot Learning (CoRL), 2021. Code available at: https://github.com/NVlabs/storm

    Journal ref: 5th Annual Conference on Robot Learning, 2021

  8. arXiv:2103.05922  [pdf, other

    cs.RO cs.LG eess.SY

    RMP2: A Structured Composable Policy Class for Robot Learning

    Authors: Anqi Li, Ching-An Cheng, M. Asif Rana, Man Xie, Karl Van Wyk, Nathan Ratliff, Byron Boots

    Abstract: We consider the problem of learning motion policies for acceleration-based robotics systems with a structured policy class specified by RMPflow. RMPflow is a multi-task control framework that has been successfully applied in many robotics problems. Using RMPflow as a structured policy class in learning has several benefits, such as sufficient expressiveness, the flexibility to inject different lev… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  9. arXiv:2012.13457  [pdf, other

    cs.RO cs.LG

    Towards Coordinated Robot Motions: End-to-End Learning of Motion Policies on Transform Trees

    Authors: M. Asif Rana, Anqi Li, Dieter Fox, Sonia Chernova, Byron Boots, Nathan Ratliff

    Abstract: Generating robot motion that fulfills multiple tasks simultaneously is challenging due to the geometric constraints imposed by the robot. In this paper, we propose to solve multi-task problems through learning structured policies from human demonstrations. Our structured policy is inspired by RMPflow, a framework for combining subtask policies on different spaces. The policy structure provides the… ▽ More

    Submitted 10 March, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

  10. arXiv:2010.15676  [pdf, other

    cs.RO math.OC

    Optimization Fabrics for Behavioral Design

    Authors: Nathan D. Ratliff, Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana

    Abstract: A common approach to the provably stable design of reactive behavior, exemplified by operational space control, is to reduce the problem to the design of virtual classical mechanical systems (energy sha**). This framework is widely used, and through it we gain stability, but at the price of expressivity. This work presents a comprehensive theoretical framework expanding this approach showing tha… ▽ More

    Submitted 25 June, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2008.02399

  11. arXiv:2010.14750  [pdf, other

    cs.RO

    Geometric Fabrics for the Acceleration-based Design of Robotic Motion

    Authors: Mandy Xie, Karl Van Wyk, Anqi Li, Muhammad Asif Rana, Qian Wan, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: This paper describes the pragmatic design and construction of geometric fabrics for sha** a robot's task-independent nominal behavior, capturing behavioral components such as obstacle avoidance, joint limit avoidance, redundancy resolution, global navigation heuristics, etc. Geometric fabrics constitute the most concrete incarnation of a new mathematical formulation for reactive behavior called… ▽ More

    Submitted 25 June, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  12. arXiv:2010.14745  [pdf, other

    cs.RO

    Generalized Nonlinear and Finsler Geometry for Robotics

    Authors: Nathan D. Ratliff, Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana

    Abstract: Robotics research has found numerous important applications of Riemannian geometry. Despite that, the concept remain challenging to many roboticists because the background material is complex and strikingly foreign. Beyond {\em Riemannian} geometry, there are many natural generalizations in the mathematical literature -- areas such as Finsler geometry and spray geometry -- but those generalization… ▽ More

    Submitted 2 July, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  13. arXiv:2008.02399  [pdf, other

    cs.RO math.OC

    Optimization Fabrics

    Authors: Nathan D. Ratliff, Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana

    Abstract: This paper presents a theory of optimization fabrics, second-order differential equations that encode nominal behaviors on a space and can be used to define the behavior of a smooth optimizer. Optimization fabrics can encode commonalities among optimization problems that reflect the structure of the space itself, enabling smooth optimization processes to intelligently navigate each problem even wh… ▽ More

    Submitted 21 August, 2020; v1 submitted 5 August, 2020; originally announced August 2020.

  14. arXiv:2007.14256  [pdf, other

    cs.RO

    RMPflow: A Geometric Framework for Generation of Multi-Task Motion Policies

    Authors: Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: Generating robot motion for multiple tasks in dynamic environments is challenging, requiring an algorithm to respond reactively while accounting for complex nonlinear relationships between tasks. In this paper, we develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.07049

  15. arXiv:2007.04842  [pdf, other

    cs.RO cs.CG

    An Interior Point Method Solving Motion Planning Problems with Narrow Passages

    Authors: Jim Mainprice, Nathan Ratliff, Marc Toussaint, Stefan Schaal

    Abstract: Algorithmic solutions for the motion planning problem have been investigated for five decades. Since the development of A* in 1969 many approaches have been investigated, traditionally classified as either grid decomposition, potential fields or sampling-based. In this work, we focus on using numerical optimization, which is understudied for solving motion planning problems. This lack of interest… ▽ More

    Submitted 24 July, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: IEEE RO-MAN 2020, 6 pages

  16. Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control

    Authors: Ian Abraham, Ankur Handa, Nathan Ratliff, Kendall Lowrey, Todd D. Murphey, Dieter Fox

    Abstract: This work addresses the problem of robot interaction in complex environments where online control and adaptation is necessary. By expanding the sample space in the free energy formulation of path integral control, we derive a natural extension to the path integral control that embeds uncertainty into action and provides robustness for model-based robot planning. Our algorithm is applied to a diver… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 5 , Issue: 2 , April 2020 )

  17. arXiv:2005.13143  [pdf, other

    cs.RO cs.LG eess.SY

    Euclideanizing Flows: Diffeomorphic Reduction for Learning Stable Dynamical Systems

    Authors: Muhammad Asif Rana, Anqi Li, Dieter Fox, Byron Boots, Fabio Ramos, Nathan Ratliff

    Abstract: Robotic tasks often require motions with complex geometric structures. We present an approach to learn such motions from a limited number of human demonstrations by exploiting the regularity properties of human motions e.g. stability, smoothness, and boundedness. The complex motions are encoded as rollouts of a stable dynamical system, which, under a change of coordinates defined by a diffeomorphi… ▽ More

    Submitted 21 September, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 2nd Annual Conference on Learning for Dynamics and Control (L4DC) 2020 -- Revised Version

  18. arXiv:2005.10872  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning

    Authors: Michelle A. Lee, Carlos Florensa, Jonathan Tremblay, Nathan Ratliff, Animesh Garg, Fabio Ramos, Dieter Fox

    Abstract: Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state. On the other hand, reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle. In this w… ▽ More

    Submitted 26 May, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Journal ref: International Conference in Robotics and Automation 2020

  19. arXiv:1910.04339  [pdf, other

    cs.RO

    Collaborative Behavior Models for Optimized Human-Robot Teamwork

    Authors: Adam Fishman, Chris Paxton, Wei Yang, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: Effective human-robot collaboration requires informed anticipation. The robot must anticipate the human's actions, but also react quickly and intuitively when its predictions are wrong. The robot must plan its actions to account for the human's own plan, with the knowledge that the human's behavior will change based on what the robot actually does. This cyclical game of predicting a human's future… ▽ More

    Submitted 3 September, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: 3 figures, 7 pages

  20. arXiv:1910.03135  [pdf, other

    cs.CV cs.LG cs.RO

    DexPilot: Vision Based Teleoperation of Dexterous Robotic Hand-Arm System

    Authors: Ankur Handa, Karl Van Wyk, Wei Yang, Jacky Liang, Yu-Wei Chao, Qian Wan, Stan Birchfield, Nathan Ratliff, Dieter Fox

    Abstract: Teleoperation offers the possibility of imparting robotic systems with sophisticated reasoning skills, intuition, and creativity to perform tasks. However, current teleoperation solutions for high degree-of-actuation (DoA), multi-fingered robots are generally cost-prohibitive, while low-cost offerings usually provide reduced degrees of control. Herein, a low-cost, vision based teleoperation system… ▽ More

    Submitted 14 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: 17 pages, first version of DexPilot

  21. arXiv:1910.02646  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Riemannian Motion Policy Fusion through Learnable Lyapunov Function Resha**

    Authors: Mustafa Mukadam, Ching-An Cheng, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: RMPflow is a recently proposed policy-fusion framework based on differential geometry. While RMPflow has demonstrated promising performance, it requires the user to provide sensible subtask policies as Riemannian motion policies (RMPs: a motion policy and an importance matrix function), which can be a difficult design problem in its own right. We propose RMPfusion, a variation of RMPflow, to addre… ▽ More

    Submitted 8 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning (CoRL), 2019

  22. arXiv:1909.12329  [pdf, other

    cs.RO

    Scaling Local Control to Large-Scale Topological Navigation

    Authors: Xiangyun Meng, Nathan Ratliff, Yu Xiang, Dieter Fox

    Abstract: Visual topological navigation has been revitalized recently thanks to the advancement of deep learning that substantially improves robot perception. However, the scalability and reliability issue remain challenging due to the complexity and ambiguity of real world images and mechanical constraints of real robots. We present an intuitive solution to show that by accurately measuring the capability… ▽ More

    Submitted 18 March, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

  23. arXiv:1908.01896  [pdf, other

    cs.RO

    Representing Robot Task Plans as Robust Logical-Dynamical Systems

    Authors: Chris Paxton, Nathan Ratliff, Clemens Eppner, Dieter Fox

    Abstract: It is difficult to create robust, reusable, and reactive behaviors for robots that can be easily extended and combined. Frameworks such as Behavior Trees are flexible but difficult to characterize, especially when designing reactions and recovery behaviors to consistently converge to a desired goal condition. We propose a framework which we call Robust Logical-Dynamical Systems (RLDS), which combi… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: 9 pages, extended version of IROS 2019 paper

  24. arXiv:1904.01762  [pdf, other

    cs.RO

    Neural Autonomous Navigation with Riemannian Motion Policy

    Authors: Xiangyun Meng, Nathan Ratliff, Yu Xiang, Dieter Fox

    Abstract: End-to-end learning for autonomous navigation has received substantial attention recently as a promising method for reducing modeling error. However, its data complexity, especially around generalization to unseen environments, is high. We introduce a novel image-based autonomous navigation technique that leverages in policy structure using the Riemannian Motion Policy (RMP) framework for deep lea… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  25. arXiv:1903.03699  [pdf, other

    cs.RO

    Joint Inference of Kinematic and Force Trajectories with Visuo-Tactile Sensing

    Authors: Alexander Lambert, Mustafa Mukadam, Balakumar Sundaralingam, Nathan Ratliff, Byron Boots, Dieter Fox

    Abstract: To perform complex tasks, robots must be able to interact with and manipulate their surroundings. One of the key challenges in accomplishing this is robust state estimation during physical interactions, where the state involves not only the robot and the object being manipulated, but also the state of the contact itself. In this work, within the context of planar pushing, we extend previous infere… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

  26. arXiv:1811.07049  [pdf, other

    cs.RO eess.SY

    RMPflow: A Computational Graph for Automatic Motion Policy Generation

    Authors: Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: We develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies designed to parameterize non-Euclidean behaviors as dynamical systems in intrinsically nonlinear task spaces. Given a set of RMPs designed for individual tasks, RMPflow can consistently combine these local polic… ▽ More

    Submitted 5 April, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: WAFR 2018

  27. Learning Latent Space Dynamics for Tactile Servoing

    Authors: Giovanni Sutanto, Nathan Ratliff, Balakumar Sundaralingam, Yevgen Chebotar, Zhe Su, Ankur Handa, Dieter Fox

    Abstract: To achieve a dexterous robotic manipulation, we need to endow our robot with tactile feedback capability, i.e. the ability to drive action based on tactile sensing. In this paper, we specifically address the challenge of tactile servoing, i.e. given the current tactile sensing and a target/goal tactile sensing --memorized from a successful task execution in the past-- what is the action that will… ▽ More

    Submitted 15 April, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted to be published at the International Conference on Robotics and Automation (ICRA) 2019. The final version for publication at ICRA 2019 is 7 pages (i.e. 6 pages of technical content (including text, figures, tables, acknowledgement, etc.) and 1 page of the Bibliography/References), while this arXiv version is 8 pages (added Appendix and some extra details)

  28. arXiv:1810.06509  [pdf, other

    cs.LG stat.ML

    Predictor-Corrector Policy Optimization

    Authors: Ching-An Cheng, Xinyan Yan, Nathan Ratliff, Byron Boots

    Abstract: We present a predictor-corrector framework, called PicCoLO, that can transform a first-order model-free reinforcement or imitation learning algorithm into a new hybrid method that leverages predictive models to accelerate policy learning. The new "PicCoLOed" algorithm optimizes a policy by recursively repeating two steps: In the Prediction Step, the learner uses a model to predict the unseen futur… ▽ More

    Submitted 24 May, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

  29. arXiv:1810.06187  [pdf, other

    cs.RO

    Robust Learning of Tactile Force Estimation through Robot Interaction

    Authors: Balakumar Sundaralingam, Alexander Lambert, Ankur Handa, Byron Boots, Tucker Hermans, Stan Birchfield, Nathan Ratliff, Dieter Fox

    Abstract: Current methods for estimating force from tactile sensor signals are either inaccurate analytic models or task-specific learned models. In this paper, we explore learning a robust model that maps tactile sensor signals to force. We specifically explore learning a map** for the SynTouch BioTac sensor via neural networks. We propose a voxelized input feature layer for spatial signals and leverage… ▽ More

    Submitted 5 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: accepted to ICRA 2019 (camera ready version)

  30. arXiv:1810.05687  [pdf, other

    cs.RO cs.LG

    Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience

    Authors: Yevgen Chebotar, Ankur Handa, Viktor Makoviychuk, Miles Macklin, Jan Issac, Nathan Ratliff, Dieter Fox

    Abstract: We consider the problem of transferring policies to the real world by training on a distribution of simulated scenarios. Rather than manually tuning the randomization of simulations, we adapt the simulation parameter distribution using a few real world roll-outs interleaved with policy training. In doing so, we are able to change the distribution of simulations to improve the policy transfer by ma… ▽ More

    Submitted 5 March, 2019; v1 submitted 12 October, 2018; originally announced October 2018.

  31. arXiv:1801.02854  [pdf, other

    cs.RO

    Riemannian Motion Policies

    Authors: Nathan D. Ratliff, Jan Issac, Daniel Kappler, Stan Birchfield, Dieter Fox

    Abstract: We introduce the Riemannian Motion Policy (RMP), a new mathematical object for modular motion generation. An RMP is a second-order dynamical system (acceleration field or motion policy) coupled with a corresponding Riemannian metric. The motion policy maps positions and velocities to accelerations, while the metric captures the directions in the space important to the policy. We show that RMPs pro… ▽ More

    Submitted 25 July, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

  32. arXiv:1710.02513  [pdf, other

    cs.RO

    A New Data Source for Inverse Dynamics Learning

    Authors: Daniel Kappler, Franziska Meier, Nathan Ratliff, Stefan Schaal

    Abstract: Modern robotics is gravitating toward increasingly collaborative human robot interaction. Tools such as acceleration policies can naturally support the realization of reactive, adaptive, and compliant robots. These tools require us to model the system dynamics accurately -- a difficult task. The fundamental problem remains that simulation and reality diverge--we do not know how to accurately chang… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: IROS 2017

  33. arXiv:1703.03512  [pdf, other

    cs.RO

    Real-time Perception meets Reactive Motion Generation

    Authors: Daniel Kappler, Franziska Meier, Jan Issac, Jim Mainprice, Cristina Garcia Cifuentes, Manuel Wüthrich, Vincent Berenz, Stefan Schaal, Nathan Ratliff, Jeannette Bohg

    Abstract: We address the challenging problem of robotic gras** and manipulation in the presence of uncertainty. This uncertainty is due to noisy sensing, inaccurate models and hard-to-predict environment dynamics. We quantify the importance of continuous, real-time perception and its tight integration with reactive motion generation methods in dynamic manipulation scenarios. We compare three different sys… ▽ More

    Submitted 6 October, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

  34. arXiv:1608.00309  [pdf, other

    cs.RO

    DOOMED: Direct Online Optimization of Modeling Errors in Dynamics

    Authors: Nathan Ratliff, Franziska Meier, Daniel Kappler, Stefan Schaal

    Abstract: It has long been hoped that model-based control will improve tracking performance while maintaining or increasing compliance. This hope hinges on having or being able to estimate an accurate inverse dynamics model. As a result, substantial effort has gone into modeling and estimating dynamics (error) models. Most recent research has focused on learning the true inverse dynamics using data points m… ▽ More

    Submitted 9 August, 2016; v1 submitted 31 July, 2016; originally announced August 2016.

    Comments: Added an acknowledgements section

  35. arXiv:1605.09296  [pdf, other

    cs.RO

    On the Fundamental Importance of Gauss-Newton in Motion Optimization

    Authors: Nathan Ratliff, Marc Toussaint, Jeannette Bohg, Stefan Schaal

    Abstract: Hessian information speeds convergence substantially in motion optimization. The better the Hessian approximation the better the convergence. But how good is a given approximation theoretically? How much are we losing? This paper addresses that question and proves that for a particularly popular and empirically strong approximation known as the Gauss-Newton approximation, we actually lose very lit… ▽ More

    Submitted 30 May, 2016; originally announced May 2016.

  36. arXiv:1503.06375  [pdf, other

    cs.RO

    Policy Learning with Hypothesis based Local Action Selection

    Authors: Bharath Sankaran, Jeannette Bohg, Nathan Ratliff, Stefan Schaal

    Abstract: For robots to be able to manipulate in unknown and unstructured environments the robot should be capable of operating under partial observability of the environment. Object occlusions and unmodeled environments are some of the factors that result in partial observability. A common scenario where this is encountered is manipulation in clutter. In the case that the robot needs to locate an object of… ▽ More

    Submitted 8 May, 2015; v1 submitted 21 March, 2015; originally announced March 2015.

    Comments: RLDM abstract

  37. arXiv:1202.3702  [pdf

    cs.LG stat.ML

    Semi-supervised Learning with Density Based Distances

    Authors: Avleen S. Bijral, Nathan Ratliff, Nathan Srebro

    Abstract: We present a simple, yet effective, approach to Semi-Supervised Learning. Our approach is based on estimating density-based distances (DBD) using a shortest path calculation on a graph. These Graph-DBD estimates can then be used in any distance-based supervised learning method, such as Nearest Neighbor methods and SVMs with RBF kernels. In order to apply the method to very large data sets, we also… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-43-50