Skip to main content

Showing 1–15 of 15 results for author: Vemula, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.00694  [pdf, other

    cs.LG cs.RO eess.SY

    The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms

    Authors: Anirudh Vemula, Yuda Song, Aarti Singh, J. Andrew Bagnell, Sanjiban Choudhury

    Abstract: We propose a novel approach to addressing two fundamental challenges in Model-based Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good policy in the learned model, and the objective mismatch between model fitting and policy computation. Our "lazy" method leverages a novel unified objective, Performance Difference via Advantage in Model, to capture the performance… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  2. arXiv:2111.09434  [pdf, other

    cs.RO cs.LG eess.SY

    On the Effectiveness of Iterative Learning Control

    Authors: Anirudh Vemula, Wen Sun, Maxim Likhachev, J. Andrew Bagnell

    Abstract: Iterative learning control (ILC) is a powerful technique for high performance tracking in the presence of modeling errors for optimal control applications. There is extensive prior work showing its empirical effectiveness in applications such as chemical reactors, industrial robots and quadcopters. However, there is little prior theoretical work that explains the effectiveness of ILC even in the p… ▽ More

    Submitted 8 December, 2021; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Submitted to L4DC 2022

  3. arXiv:2109.12427  [pdf, other

    cs.RO

    Improved Soft Duplicate Detection in Search-Based Motion Planning

    Authors: Nader Maray, Anirudh Vemula, Maxim Likhachev

    Abstract: Search-based techniques have shown great success in motion planning problems such as robotic navigation by discretizing the state space and precomputing motion primitives. However in domains with complex dynamic constraints, constructing motion primitives in a discretized state space is non-trivial. This requires operating in continuous space which can be challenging for search-based planners as t… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

    Comments: submitted to ICRA2022

    MSC Class: ACM-class: I.2.9

  4. arXiv:2105.05019  [pdf, other

    cs.RO cs.LG

    Learning Optimal Decision Making for an Industrial Truck Unloading Robot using Minimal Simulator Runs

    Authors: Manash Pratim Das, Anirudh Vemula, Mayank Pathak, Sandip Aine, Maxim Likhachev

    Abstract: Consider a truck filled with boxes of varying size and unknown mass and an industrial robot with end-effectors that can unload multiple boxes from any reachable location. In this work, we investigate how would the robot with the help of a simulator, learn to maximize the number of boxes unloaded by each action. Most high-fidelity robotic simulators like ours are time-consuming. Therefore, we inves… ▽ More

    Submitted 13 March, 2021; originally announced May 2021.

    Comments: 8 pages, 8 figures, Pre-Print. This work has been submitted to the IEEE for possible publication

  5. arXiv:2009.09942  [pdf, other

    cs.RO cs.AI cs.LG

    CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models

    Authors: Anirudh Vemula, J. Andrew Bagnell, Maxim Likhachev

    Abstract: Given access to accurate dynamical models, modern planning approaches are effective in computing feasible and optimal plans for repetitive robotic tasks. However, it is difficult to model the true dynamics of the real world before execution, especially for tasks requiring interactions with objects whose parameters are unknown. A recent planning approach, CMAX, tackles this problem by adapting the… ▽ More

    Submitted 15 October, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

  6. arXiv:2004.00500  [pdf, other

    cs.LG stat.ML

    Exploration in Action Space

    Authors: Anirudh Vemula, Wen Sun, J. Andrew Bagnell

    Abstract: Parameter space exploration methods with black-box optimization have recently been shown to outperform state-of-the-art approaches in continuous control reinforcement learning domains. In this paper, we examine reasons why these methods work better and the situations in which they are worse than traditional action space exploration methods. Through a simple theoretical analysis, we show that when… ▽ More

    Submitted 30 March, 2020; originally announced April 2020.

    Comments: Presented at RSS 2018 in Learning and Inference in Robotics: Integrating Structure, Priors and Models workshop. arXiv admin note: text overlap with arXiv:1901.11503

  7. arXiv:2003.14393  [pdf, other

    cs.RO eess.SY

    TRON: A Fast Solver for Trajectory Optimization with Non-Smooth Cost Functions

    Authors: Anirudh Vemula, J. Andrew Bagnell

    Abstract: Trajectory optimization is an important tool for control and planning of complex, underactuated robots, and has shown impressive results in real world robotic tasks. However, in applications where the cost function to be optimized is non-smooth, modern trajectory optimization methods have extremely slow convergence. In this work, we present TRON, an iterative solver that can be used for efficient… ▽ More

    Submitted 31 March, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

    Comments: Submitted to CDC 2020

  8. arXiv:2003.04394  [pdf, other

    cs.RO cs.LG

    Planning and Execution using Inaccurate Models with Provable Guarantees

    Authors: Anirudh Vemula, Yash Oza, J. Andrew Bagnell, Maxim Likhachev

    Abstract: Models used in modern planning problems to simulate outcomes of real world action executions are becoming increasingly complex, ranging from simulators that do physics-based reasoning to precomputed analytical motion primitives. However, robots operating in the real world often face situations not modeled by these models before execution. This imperfect modeling can lead to highly suboptimal or ev… ▽ More

    Submitted 15 October, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: Accepted at RSS 2020. 12 pages, 5 figures. Code at https://github.com/vvanirudh/CMAX , video at https://youtu.be/eQmAeWIhjO8 and blog post at https://vvanirudh.github.io/blog/cmax

  9. arXiv:1910.12284  [pdf, other

    cs.RO

    Task-Informed Fidelity Management for Speeding Up Robotics Simulation

    Authors: Abhijeet Tallavajhula, Adrian Schoisengeier, Sung-Kyun Kim, Anirudh Vemula, Levi Lister, Oren Salzman

    Abstract: Simulators are an important tool in robotics that is used to develop robot software and generate synthetic data for machine learning algorithms. Faster simulation can result in better software validation and larger amounts of data. Previous efforts for speeding up simulators have been performed at the level of simulator building blocks, and robot systems. Our key insight, motivating this work, is… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

  10. arXiv:1910.09453  [pdf, other

    cs.RO

    Planning, Learning and Reasoning Framework for Robot Truck Unloading

    Authors: Fahad Islam, Anirudh Vemula, Sung-Kyun Kim, Andrew Dornbush, Oren Salzman, Maxim Likhachev

    Abstract: We consider the task of autonomously unloading boxes from trucks using an industrial manipulator robot. There are multiple challenges that arise: (1) real-time motion planning for a complex robotic system carrying two articulated mechanisms, an arm and a scooper, (2) decision-making in terms of what action to execute next given imperfect information about boxes such as their masses, (3) accounting… ▽ More

    Submitted 18 June, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  11. arXiv:1905.10948  [pdf, other

    cs.LG stat.ML

    Provably Efficient Imitation Learning from Observation Alone

    Authors: Wen Sun, Anirudh Vemula, Byron Boots, J. Andrew Bagnell

    Abstract: We study Imitation Learning (IL) from Observations alone (ILFO) in large-scale MDPs. While most IL algorithms rely on an expert to directly provide actions to the learner, in this setting the expert only supplies sequences of observations. We design a new model-free algorithm for ILFO, Forward Adversarial Imitation Learning (FAIL), which learns a sequence of time-dependent policies by minimizing a… ▽ More

    Submitted 11 June, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  12. arXiv:1901.11503  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective

    Authors: Anirudh Vemula, Wen Sun, J. Andrew Bagnell

    Abstract: Black-box optimizers that explore in parameter space have often been shown to outperform more sophisticated action space exploration methods developed specifically for the reinforcement learning problem. We examine these black-box methods closely to identify situations in which they are worse than action space exploration methods and those in which they are superior. Through simple theoretical ana… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

    Comments: Accepted at AISTATS 2019

  13. arXiv:1710.04689  [pdf, other

    cs.RO cs.LG

    Social Attention: Modeling Attention in Human Crowds

    Authors: Anirudh Vemula, Katharina Muelling, Jean Oh

    Abstract: Robots that navigate through human crowds need to be able to plan safe, efficient, and human predictable trajectories. This is a particularly challenging problem as it requires the robot to predict future human trajectories within a crowd where everyone implicitly cooperates with each other to avoid collisions. Previous approaches to human trajectory prediction have modeled the interactions betwee… ▽ More

    Submitted 29 October, 2018; v1 submitted 12 October, 2017; originally announced October 2017.

  14. arXiv:1705.06201  [pdf, other

    cs.RO

    Modeling Cooperative Navigation in Dense Human Crowds

    Authors: Anirudh Vemula, Katharina Muelling, Jean Oh

    Abstract: For robots to be a part of our daily life, they need to be able to navigate among crowds not only safely but also in a socially compliant fashion. This is a challenging problem because humans tend to navigate by implicitly cooperating with one another to avoid collisions, while heading toward their respective destinations. Previous approaches have used hand-crafted functions based on proximity to… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: Accepted at ICRA 2017

  15. arXiv:1605.06853  [pdf, other

    cs.RO

    Path Planning in Dynamic Environments with Adaptive Dimensionality

    Authors: Anirudh Vemula, Katharina Muelling, Jean Oh

    Abstract: Path planning in the presence of dynamic obstacles is a challenging problem due to the added time dimension in search space. In approaches that ignore the time dimension and treat dynamic obstacles as static, frequent re-planning is unavoidable as the obstacles move, and their solutions are generally sub-optimal and can be incomplete. To achieve both optimality and completeness, it is necessary to… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    Comments: Accepted in SoCS 2016