Skip to main content

Showing 1–6 of 6 results for author: Preiss, J A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.09537  [pdf, other

    cs.RO cs.AI cs.LG cs.MA eess.SY

    QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

    Authors: Zhehui Huang, Sumeet Batra, Tao Chen, Rahul Krupani, Tushar Kumar, Artem Molchanov, Aleksei Petrenko, James A. Preiss, Zhao**g Yang, Gaurav S. Sukhatme

    Abstract: Reinforcement learning (RL) has shown promise in creating robust policies for robotics tasks. However, contemporary RL algorithms are data-hungry, often requiring billions of environment transitions to train successful policies. This necessitates the use of fast and highly-parallelizable simulators. In addition to speed, such simulators need to model the physics of the robots and their interaction… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Paper published in ICRA 2023 Workshop: The Role of Robotics Simulators for Unmanned Aerial Vehicles. The workshop can be found in https://imrclab.github.io/workshop-uav-sims-icra2023/

  2. arXiv:1910.01917  [pdf, other

    cs.RO cs.AI cs.MA

    Resilient Coverage: Exploring the Local-to-Global Trade-off

    Authors: Ragesh K. Ramachandran, Lifeng Zhou James A. Preiss, Gaurav S. Sukhatme

    Abstract: We propose a centralized control framework to select suitable robots from a heterogeneous pool and place them at appropriate locations to monitor a region for events of interest. In the event of a robot failure, the framework repositions robots in a user-defined local neighborhood of the failed robot to compensate for the coverage loss. The central controller augments the team with additional robo… ▽ More

    Submitted 15 April, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: 8 pages, 5 figures, submitted to IROS 2020

  3. arXiv:1910.01249  [pdf, other

    cs.LG stat.ML

    Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator

    Authors: James A. Preiss, Sébastien M. R. Arnold, Chen-Yu Wei, Marius Kloft

    Abstract: We study the variance of the REINFORCE policy gradient estimator in environments with continuous state and action spaces, linear dynamics, quadratic cost, and Gaussian noise. These simple environments allow us to derive bounds on the estimator variance in terms of the environment and noise parameters. We compare the predictions of our bounds to the empirical variance in simulation experiments.

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 Workshop on Optimization Foundations for Reinforcement Learning. 7 pages + 6 pages appendix

  4. arXiv:1903.04856  [pdf, other

    cs.RO

    Resilience by Reconfiguration: Exploiting Heterogeneity in Robot Teams

    Authors: Ragesh K. Ramachandran, James A. Preiss, Gaurav S. Sukhatme

    Abstract: We propose a method to maintain high resource in a networked heterogeneous multi-robot system to resource failures. In our model, resources such as and computation are available on robots. The robots engaged in a joint task using these pooled resources. In our model, a resource on a particular robot becomes unavailable e.g., a sensor ceases to function due to a failure), the system reconfigures so… ▽ More

    Submitted 14 May, 2019; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures, submitted to 2019 IROS RA-Letter

  5. arXiv:1903.04628  [pdf, other

    cs.RO

    Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors

    Authors: Artem Molchanov, Tao Chen, Wolfgang Hönig, James A. Preiss, Nora Ayanian, Gaurav S. Sukhatme

    Abstract: Quadrotor stabilizing controllers often require careful, model-specific tuning for safe operation. We use reinforcement learning to train policies in simulation that transfer remarkably well to multiple different physical quadrotors. Our policies are low-level, i.e., we map the rotorcrafts' state directly to the motor outputs. The trained control policies are very robust to external disturbances a… ▽ More

    Submitted 16 April, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

  6. arXiv:1704.04852  [pdf, other

    cs.RO

    Downwash-Aware Trajectory Planning for Large Quadrotor Teams

    Authors: James A. Preiss, Wolfgang Hönig, Nora Ayanian, Gaurav S. Sukhatme

    Abstract: We describe a method for formation-change trajectory planning for large quadrotor teams in obstacle-rich environments. Our method decomposes the planning problem into two stages: a discrete planner operating on a graph representation of the workspace, and a continuous refinement that converts the non-smooth graph plan into a set of C^k-continuous trajectories, locally optimizing an integral-square… ▽ More

    Submitted 23 July, 2017; v1 submitted 16 April, 2017; originally announced April 2017.

    Comments: 8 pages