Skip to main content

Showing 51–100 of 152 results for author: How, J

.
  1. arXiv:2203.03535  [pdf, other

    cs.LG cs.AI cs.MA

    Influencing Long-Term Behavior in Multiagent Reinforcement Learning

    Authors: Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How

    Abstract: The main challenge of multiagent reinforcement learning is the difficulty of learning useful policies in the presence of other simultaneously learning agents whose changing behaviors jointly affect the environment's transition and reward dynamics. An effective approach that has recently emerged for addressing this non-stationarity is for each agent to anticipate the learning of other agents and in… ▽ More

    Submitted 15 October, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted to NeurIPS 2022. The earlier version was presented at the Gamification and Multiagent Solutions Workshop (ICLR 2022) with a spotlight. Code at https://github.com/dkkim93/further and videos at https://sites.google.com/view/further-marl

  2. arXiv:2203.00851  [pdf, other

    cs.RO math.OC

    Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation

    Authors: Yulun Tian, Amrit Singh Bedi, Alec Koppel, Miguel Calvo-Fullana, David M. Rosen, Jonathan P. How

    Abstract: We present the first distributed optimization algorithm with lazy communication for collaborative geometric estimation, the backbone of modern collaborative simultaneous localization and map** (SLAM) and structure-from-motion (SfM) applications. Our method allows agents to cooperatively reconstruct a shared geometric model on a central server by fusing individual observations, but without the ne… ▽ More

    Submitted 29 July, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: technical report (17 pages, 3 figures); to appear at IROS 2022

  3. arXiv:2201.07372  [pdf, other

    cs.LG cs.AI

    Prospective Learning: Principled Extrapolation to the Future

    Authors: Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller , et al. (18 additional authors not shown)

    Abstract: Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenari… ▽ More

    Submitted 13 July, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted at the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  4. arXiv:2111.14990  [pdf, other

    cs.RO

    MIXER: A Principled Framework for Multimodal, Multiway Data Association

    Authors: Parker C. Lusk, Ronak Roy, Kaveh Fathian, Jonathan P. How

    Abstract: A fundamental problem in robotic perception is matching identical objects or data, with applications such as loop closure detection, place recognition, object tracking, and map fusion. While the problem becomes considerably more challenging when matching should be done jointly across multiple, multimodal sets of data, the robustness and accuracy of matching in the presence of noise and outliers ca… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: presented in ICRA 2021 Workshop on Robust Perception for Autonomous Field Robots in Challenging Environments

  5. arXiv:2110.00876  [pdf, other

    cs.RO cs.LG stat.AP

    Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows

    Authors: Qiangqiang Huang, Can Pu, Kasra Khosoussi, David M. Rosen, Dehann Fourie, Jonathan P. How, John J. Leonard

    Abstract: This paper presents normalizing flows for incremental smoothing and map** (NF-iSAM), a novel algorithm for inferring the full posterior distribution in SLAM problems with nonlinear measurement models and non-Gaussian factors. NF-iSAM exploits the expressive power of neural networks, and trains normalizing flows to model and sample the full posterior. By leveraging the Bayes tree, NF-iSAM enables… ▽ More

    Submitted 2 July, 2022; v1 submitted 2 October, 2021; originally announced October 2021.

    Comments: Extension of work published at arXiv:2105.05045

  6. arXiv:2109.09910  [pdf, other

    cs.RO cs.LG

    Demonstration-Efficient Guided Policy Search via Imitation of Robust Tube MPC

    Authors: Andrea Tagliabue, Dong-Ki Kim, Michael Everett, Jonathan P. How

    Abstract: We propose a demonstration-efficient strategy to compress a computationally expensive Model Predictive Controller (MPC) into a more computationally efficient representation based on a deep neural network and Imitation Learning (IL). By generating a Robust Tube variant (RTMPC) of the MPC and leveraging properties from the tube, we introduce a data augmentation method that enables high demonstration… ▽ More

    Submitted 23 September, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Submitted to the 2022 IEEE Conference on Robotics and Automation (ICRA). Video: https://youtu.be/28zQFktJIqg

  7. arXiv:2109.09876  [pdf, other

    cs.LG cs.AI

    Context-Specific Representation Abstraction for Deep Option Learning

    Authors: Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How

    Abstract: Hierarchical reinforcement learning has focused on discovering temporally extended actions, such as options, that can provide benefits in problems requiring extensive exploration. One promising approach that learns these options end-to-end is the option-critic (OC) framework. We examine and show in this paper that OC does not decompose a problem into simpler sub-problems, but instead increases the… ▽ More

    Submitted 23 April, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted at AAAI 2022

  8. arXiv:2109.06795  [pdf, other

    cs.LG

    ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation

    Authors: Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How

    Abstract: In a multirobot system, a number of cyber-physical attacks (e.g., communication hijack, observation perturbations) can challenge the robustness of agents. This robustness issue worsens in multiagent reinforcement learning because there exists the non-stationarity of the environment caused by simultaneously learning agents whose changing policies affect the transition and reward functions. In this… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

  9. arXiv:2108.04140  [pdf, other

    eess.SY cs.LG cs.RO

    Reachability Analysis of Neural Feedback Loops

    Authors: Michael Everett, Golnaz Habibi, Chuangchuang Sun, Jonathan P. How

    Abstract: Neural Networks (NNs) can provide major empirical performance improvements for closed-loop systems, but they also introduce challenges in formally analyzing those systems' safety properties. In particular, this work focuses on estimating the forward reachable set of \textit{neural feedback loops} (closed-loop systems with NN controllers). Recent work provides bounds on these reachable sets, but th… ▽ More

    Submitted 2 February, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

  10. arXiv:2106.14386  [pdf, other

    cs.RO cs.CV cs.MA

    Kimera-Multi: Robust, Distributed, Dense Metric-Semantic SLAM for Multi-Robot Systems

    Authors: Yulun Tian, Yun Chang, Fernando Herrera Arias, Carlos Nieto-Granda, Jonathan P. How, Luca Carlone

    Abstract: This paper presents Kimera-Multi, the first multi-robot system that (i) is robust and capable of identifying and rejecting incorrect inter and intra-robot loop closures resulting from perceptual aliasing, (ii) is fully distributed and only relies on local (peer-to-peer) communication to achieve distributed localization and map**, and (iii) builds a globally consistent metric-semantic 3D mesh mod… ▽ More

    Submitted 17 December, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE Transactions on Robotics (18 pages, 15 figures)

  11. arXiv:2105.13506  [pdf, other

    cs.RO

    Airflow-Inertial Odometry for Resilient State Estimation on Multirotors

    Authors: Andrea Tagliabue, Jonathan P. How

    Abstract: We present a dead reckoning strategy for increased resilience to position estimation failures on multirotors, using only data from a low-cost IMU and novel, bio-inspired airflow sensors. The goal is challenging, since low-cost IMUs are subject to large noise and drift, while 3D airflow sensing is made difficult by the interference caused by the propellers and by the wind. Our approach relies on a… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted to the 2021 International Conference on Robotics and Automation (ICRA 2021). Contains minor updates in Fig. 2 and Section IV.b, VII.E, VII.F

  12. arXiv:2105.05045  [pdf, other

    cs.RO

    NF-iSAM: Incremental Smoothing and Map** via Normalizing Flows

    Authors: Qiangqiang Huang, Can Pu, Dehann Fourie, Kasra Khosoussi, Jonathan P. How, John J. Leonard

    Abstract: This paper presents a novel non-Gaussian inference algorithm, Normalizing Flow iSAM (NF-iSAM), for solving SLAM problems with non-Gaussian factors and/or non-linear measurement models. NF-iSAM exploits the expressive power of neural networks, and trains normalizing flows to draw samples from the joint posterior of non-Gaussian factor graphs. By leveraging the Bayes tree, NF-iSAM is able to exploit… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: 8 pages, 6 figures, to be published in IEEE International Conference on Robotics and Automation (ICRA) 2021

  13. arXiv:2103.14805  [pdf, other

    cs.RO cs.MA

    Multi-Robot Distributed Semantic Map** in Unfamiliar Environments through Online Matching of Learned Representations

    Authors: Stewart Jamieson, Kaveh Fathian, Kasra Khosoussi, Jonathan P. How, Yogesh Girdhar

    Abstract: We present a solution to multi-robot distributed semantic map** of novel and unfamiliar environments. Most state-of-the-art semantic map** systems are based on supervised learning algorithms that cannot classify novel observations online. While unsupervised learning algorithms can invent labels for novel observations, approaches to detect when multiple robots have independently developed their… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

    Comments: 7 pages, 6 figures, 1 table; accepted for presentation in IEEE Int. Conf. on Robotics and Automation, ICRA '21, Xi'an, China, June 2021

  14. PANTHER: Perception-Aware Trajectory Planner in Dynamic Environments

    Authors: Jesus Tordesillas, Jonathan P. How

    Abstract: This paper presents PANTHER, a real-time perception-aware (PA) trajectory planner for multirotor-UAVs (Unmanned Aerial Vehicles) in dynamic environments. PANTHER plans trajectories that avoid dynamic obstacles while also kee** them in the sensor field of view (FOV) and minimizing the blur to aid in object tracking. The rotation and translation of the UAV are jointly optimized, which allows PANTH… ▽ More

    Submitted 22 March, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 16 pages

  15. arXiv:2102.13073  [pdf, other

    cs.RO cs.LG

    Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians

    Authors: Bruno Brito, Michael Everett, Jonathan P. How, Javier Alonso-Mora

    Abstract: Robotic navigation in environments shared with other robots or humans remains challenging because the intentions of the surrounding agents are not directly observable and the environment conditions are continuously changing. Local trajectory optimization methods, such as model predictive control (MPC), can deal with those changes but require global guidance, which is not trivial to obtain in crowd… ▽ More

    Submitted 26 February, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: 8 pages, 6 figures

  16. arXiv:2102.03443  [pdf, other

    cs.RO

    LION: Lidar-Inertial Observability-Aware Navigator for Vision-Denied Environments

    Authors: Andrea Tagliabue, Jesus Tordesillas, Xiaoyi Cai, Angel Santamaria-Navarro, Jonathan P. How, Luca Carlone, Ali-akbar Agha-mohammadi

    Abstract: State estimation for robots navigating in GPS-denied and perceptually-degraded environments, such as underground tunnels, mines and planetary subsurface voids, remains challenging in robotics. Towards this goal, we present LION (Lidar-Inertial Observability-Aware Navigator), which is part of the state estimation framework developed by the team CoSTAR for the DARPA Subterranean Challenge, where the… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: 2020 International Symposium on Experimental Robotics (ISER 2020)

  17. arXiv:2101.11093  [pdf, other

    cs.RO cs.MA

    Non-Monotone Energy-Aware Information Gathering for Heterogeneous Robot Teams

    Authors: Xiaoyi Cai, Brent Schlotfeldt, Kasra Khosoussi, Nikolay Atanasov, George J. Pappas, Jonathan P. How

    Abstract: This paper considers the problem of planning trajectories for a team of sensor-equipped robots to reduce uncertainty about a dynamical process. Optimizing the trade-off between information gain and energy cost (e.g., control effort, distance travelled) is desirable but leads to a non-monotone objective function in the set of robot trajectories. Therefore, common multi-robot planning algorithms bas… ▽ More

    Submitted 26 March, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: To appear in ICRA 2021. Video: https://www.youtube.com/watch?v=xWgFi6fwex0

  18. arXiv:2101.01815  [pdf, other

    eess.SY cs.LG cs.RO

    Efficient Reachability Analysis of Closed-Loop Systems with Neural Network Controllers

    Authors: Michael Everett, Golnaz Habibi, Jonathan P. How

    Abstract: Neural Networks (NNs) can provide major empirical performance improvements for robotic systems, but they also introduce challenges in formally analyzing those systems' safety properties. In particular, this work focuses on estimating the forward reachable set of closed-loop systems with NN controllers. Recent work provides bounds on these reachable sets, yet the computationally efficient approache… ▽ More

    Submitted 24 May, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

  19. arXiv:2012.12403  [pdf, other

    eess.SY

    Performance Analysis of Adaptive Dynamic Tube MPC

    Authors: Savva Morozov, Parker C. Lusk, Brett T. Lopez, Jonathan P. How

    Abstract: Model predictive control (MPC) is an effective method for control of constrained systems but is susceptible to the external disturbances and modeling error often encountered in real-world applications. To address these issues, techniques such as Tube MPC (TMPC) utilize an ancillary offline-generated robust controller to ensure that the system remains within an invariant set, referred to as a tube,… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 14 main pages, 2 additional pages, accepted to 2021 AIAA SciTech Forum

  20. arXiv:2011.10202  [pdf, other

    cs.RO cs.CV

    CLIPPER: A Graph-Theoretic Framework for Robust Data Association

    Authors: Parker C. Lusk, Kaveh Fathian, Jonathan P. How

    Abstract: We present CLIPPER (Consistent LInking, Pruning, and Pairwise Error Rectification), a framework for robust data association in the presence of noise and outliers. We formulate the problem in a graph-theoretic framework using the notion of geometric consistency. State-of-the-art techniques that use this framework utilize either combinatorial optimization techniques that do not scale well to large-s… ▽ More

    Submitted 9 April, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

    Comments: accepted ICRA'21

  21. Kimera-Multi: a System for Distributed Multi-Robot Metric-Semantic Simultaneous Localization and Map**

    Authors: Yun Chang, Yulun Tian, Jonathan P. How, Luca Carlone

    Abstract: We present the first fully distributed multi-robot system for dense metric-semantic Simultaneous Localization and Map** (SLAM). Our system, dubbed Kimera-Multi, is implemented by a team of robots equipped with visual-inertial sensors, and builds a 3D mesh model of the environment in real-time, where each face of the mesh is annotated with a semantic label (e.g., building, road, objects). In Kime… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: 9 pages

  22. arXiv:2011.00382  [pdf, other

    cs.LG cs.AI cs.MA

    A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

    Authors: Dong-Ki Kim, Miao Liu, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How

    Abstract: A fundamental challenge in multiagent reinforcement learning is to learn beneficial behaviors in a shared environment with other simultaneously learning agents. In particular, each agent perceives the environment as effectively non-stationary due to the changing policies of other agents. Moreover, each agent is itself constantly learning, leading to natural non-stationarity in the distribution of… ▽ More

    Submitted 11 June, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

    Comments: Accepted to ICML 2021. Code at https://github.com/dkkim93/meta-mapg and Videos at https://sites.google.com/view/meta-mapg/home

  23. arXiv:2010.11061  [pdf, other

    cs.RO cs.MA

    MADER: Trajectory Planner in Multi-Agent and Dynamic Environments

    Authors: Jesus Tordesillas, Jonathan P. How

    Abstract: This paper presents MADER, a 3D decentralized and asynchronous trajectory planner for UAVs that generates collision-free trajectories in environments with static obstacles, dynamic obstacles, and other planning agents. Real-time collision avoidance with other dynamic obstacles or agents is done by performing outer polyhedral representations of every interval of the trajectories and then including… ▽ More

    Submitted 15 April, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 15 pages, 15 figures

  24. arXiv:2010.10726  [pdf, other

    cs.CG cs.GR cs.RO

    MINVO Basis: Finding Simplexes with Minimum Volume Enclosing Polynomial Curves

    Authors: Jesus Tordesillas, Jonathan P. How

    Abstract: This paper studies the polynomial basis that generates the smallest $n$-simplex enclosing a given $n^{\text{th}}$-degree polynomial curve in $\mathbb{R}^n$. Although the Bernstein and B-Spline polynomial bases provide feasible solutions to this problem, the simplexes obtained by these bases are not the smallest possible, which leads to overly conservative results in many CAD (computer-aided design… ▽ More

    Submitted 26 September, 2022; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 27 pages, 25 figures

  25. arXiv:2010.00540  [pdf, other

    cs.LG eess.SY stat.ML

    Robustness Analysis of Neural Networks via Efficient Partitioning with Applications in Control Systems

    Authors: Michael Everett, Golnaz Habibi, Jonathan P. How

    Abstract: Neural networks (NNs) are now routinely implemented on systems that must operate in uncertain environments, but the tools for formally analyzing how this uncertainty propagates to NN outputs are not yet commonplace. Computing tight bounds on NN output sets (given an input set) provides a measure of confidence associated with the NN decisions and is essential to deploy NNs on safety-critical system… ▽ More

    Submitted 7 December, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

  26. arXiv:2007.07702  [pdf, other

    cs.CV cs.RO eess.IV eess.SY

    Lunar Terrain Relative Navigation Using a Convolutional Neural Network for Visual Crater Detection

    Authors: Lena M. Downes, Ted J. Steiner, Jonathan P. How

    Abstract: Terrain relative navigation can improve the precision of a spacecraft's position estimate by detecting global features that act as supplementary measurements to correct for drift in the inertial navigation system. This paper presents a system that uses a convolutional neural network (CNN) and image processing methods to track the location of a simulated spacecraft with an extended Kalman filter (E… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 6 pages, 4 figures. This work was accepted by the 2020 American Control Conference

  27. arXiv:2006.11419  [pdf, other

    cs.LG cs.AI stat.ML

    FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize

    Authors: Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How

    Abstract: This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation monotonically decrease, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guarant… ▽ More

    Submitted 5 May, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Accepted to ICML 2020 Workshop Theoretical Foundations of RL; Accepted to ICRA 2021

  28. arXiv:2006.01109  [pdf, other

    cs.RO math.OC

    Collision Probabilities for Continuous-Time Systems Without Sampling [with Appendices]

    Authors: Kristoffer M. Frey, Ted J. Steiner, Jonathan P. How

    Abstract: Demand for high-performance, robust, and safe autonomous systems has grown substantially in recent years. These objectives motivate the desire for efficient safety-theoretic reasoning that can be embedded in core decision-making tasks such as motion planning, particularly in constrained environments. On one hand, Monte-Carlo (MC) and other sampling-based techniques provide accurate collision proba… ▽ More

    Submitted 24 December, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Presented at RSS 2020. Updated version contains restructured proofs and analysis, as well as as a number of notational tweaks throughout

  29. arXiv:2004.06496  [pdf, other

    cs.LG cs.CR stat.ML

    Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning

    Authors: Michael Everett, Bjorn Lutjens, Jonathan P. How

    Abstract: Deep Neural Network-based systems are now the state-of-the-art in many robotics tasks, but their application in safety-critical domains remains dangerous without formal guarantees on network robustness. Small perturbations to sensor inputs (from noise or adversarial examples) are often enough to change network-based decisions, which was recently shown to cause an autonomous vehicle to swerve into… ▽ More

    Submitted 2 February, 2022; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1910.12908

  30. arXiv:2003.10028  [pdf, other

    eess.SY

    Robust Adaptive Control Barrier Functions: An Adaptive & Data-Driven Approach to Safety (Extended Version)

    Authors: Brett T. Lopez, Jean-Jacques E. Slotine, Jonathan P. How

    Abstract: A new framework is developed for control of constrained nonlinear systems with structured parametric uncertainties. Forward invariance of a safe set is achieved through online parameter adaptation and data-driven model estimation. The new adaptive data-driven safety paradigm is merged with a recent adaptive control algorithm for systems nominally contracting in closed-loop. This unification is mor… ▽ More

    Submitted 28 May, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: Added aCBF non-Lipschitz example and discussion on approach implementation

  31. Active Reward Learning for Co-Robotic Vision Based Exploration in Bandwidth Limited Environments

    Authors: Stewart Jamieson, Jonathan P. How, Yogesh Girdhar

    Abstract: We present a novel POMDP problem formulation for a robot that must autonomously decide where to go to collect new and scientifically relevant images given a limited ability to communicate with its human operator. From this formulation we derive constraints and design principles for the observation model, reward model, and communication strategy of such a robot, exploring techniques to deal with th… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures; accepted for presentation in IEEE Int. Conf. on Robotics and Automation, ICRA '20, Paris, France, June 2020

    Journal ref: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 2020, pp. 1806-1812

  32. arXiv:2003.03281  [pdf, ps, other

    math.OC cs.MA cs.RO

    Asynchronous and Parallel Distributed Pose Graph Optimization

    Authors: Yulun Tian, Alec Koppel, Amrit Singh Bedi, Jonathan P. How

    Abstract: We present Asynchronous Stochastic Parallel Pose Graph Optimization (ASAPP), the first asynchronous algorithm for distributed pose graph optimization (PGO) in multi-robot simultaneous localization and map**. By enabling robots to optimize their local trajectory estimates without synchronization, ASAPP offers resiliency against communication delays and alleviates the need to wait for stragglers i… ▽ More

    Submitted 30 June, 2023; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: full paper with appendices

  33. arXiv:2003.02305  [pdf, other

    cs.RO cs.LG eess.SY

    Touch the Wind: Simultaneous Airflow, Drag and Interaction Sensing on a Multirotor

    Authors: Andrea Tagliabue, Aleix Paris, Suhan Kim, Regan Kubicek, Sarah Bergbreiter, Jonathan P. How

    Abstract: Disturbance estimation for Micro Aerial Vehicles (MAVs) is crucial for robustness and safety. In this paper, we use novel, bio-inspired airflow sensors to measure the airflow acting on a MAV, and we fuse this information in an Unscented Kalman Filter (UKF) to simultaneously estimate the three-dimensional wind vector, the drag force, and other interaction forces (e.g. due to collisions, interaction… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: The first two authors contributed equally

  34. A Distributed Pipeline for Scalable, Deconflicted Formation Flying

    Authors: Parker C. Lusk, Xiaoyi Cai, Samir Wadhwania, Aleix Paris, Kaveh Fathian, Jonathan P. How

    Abstract: Reliance on external localization infrastructure and centralized coordination are main limiting factors for formation flying of vehicles in large numbers and in unprepared environments. While solutions using onboard localization address the dependency on external infrastructure, the associated coordination strategies typically lack collision avoidance and scalability. To address these shortcomings… ▽ More

    Submitted 3 July, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 8 main pages, 1 additional page, accepted to RA-L and IROS'20

  35. arXiv:2003.01040  [pdf, other

    cs.MA cs.LG

    Scaling Up Multiagent Reinforcement Learning for Robotic Systems: Learn an Adaptive Sparse Communication Graph

    Authors: Chuangchuang Sun, Macheng Shen, Jonathan P. How

    Abstract: The complexity of multiagent reinforcement learning (MARL) in multiagent systems increases exponentially with respect to the agent number. This scalability issue prevents MARL from being applied in large-scale multiagent systems. However, one critical feature in MARL that is often neglected is that the interactions between agents are quite sparse. Without exploiting this sparsity structure, existi… ▽ More

    Submitted 3 March, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  36. arXiv:2002.06684  [pdf, other

    cs.MA cs.AI

    R-MADDPG for Partially Observable Environments and Limited Communication

    Authors: Rose E. Wang, Michael Everett, Jonathan P. How

    Abstract: There are several real-world tasks that would benefit from applying multiagent reinforcement learning (MARL) algorithms, including the coordination among self-driving cars. The real world has challenging conditions for multiagent learning systems, such as its partial observable and nonstationary nature. Moreover, if agents must share a limited resource (e.g. network bandwidth) they must all learn… ▽ More

    Submitted 17 February, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

    Comments: Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 36th International Conference on Machine Learning, Long Beach, California, USA, 2019

    Journal ref: Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 36th International Conference on Machine Learning, Long Beach, California, USA, 2019

  37. arXiv:2001.06627  [pdf, other

    cs.LG cs.AI cs.RO

    Multi-agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

    Authors: Samaneh Hosseini Semnani, Hugh Liu, Michael Everett, Anton de Ruiter, Jonathan P. How

    Abstract: This paper introduces a hybrid algorithm of deep reinforcement learning (RL) and Force-based motion planning (FMP) to solve distributed motion planning problem in dense and dynamic environments. Individually, RL and FMP algorithms each have their own limitations. FMP is not able to produce time-optimal paths and existing RL solutions are not able to produce collision-free paths in dense environmen… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: IEEE Robotics and Automation Letters (2020)

  38. arXiv:2001.04420  [pdf, other

    cs.RO cs.CV

    FASTER: Fast and Safe Trajectory Planner for Navigation in Unknown Environments

    Authors: Jesus Tordesillas, Brett T. Lopez, Michael Everett, Jonathan P. How

    Abstract: Planning high-speed trajectories for UAVs in unknown environments requires algorithmic techniques that enable fast reaction times to guarantee safety as more information about the environment becomes available. The standard approaches that ensure safety by enforcing a "stop" condition in the free-known space can severely limit the speed of the vehicle, especially in situations where much of the wo… ▽ More

    Submitted 30 August, 2021; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: This paper has been accepted for publication in IEEE Transactions on Robotics. arXiv admin note: text overlap with arXiv:1903.03558

  39. arXiv:1911.09476  [pdf, other

    cs.RO cs.AI cs.LG

    Incremental Learning of Motion Primitives for Pedestrian Trajectory Prediction at Intersections

    Authors: Golnaz Habibi, Nikita Japuria, Jonathan P. How

    Abstract: This paper presents a novel incremental learning algorithm for pedestrian motion prediction, with the ability to improve the learned model over time when data is incrementally available. In this setup, trajectories are modeled as simple segments called motion primitives. Transitions between motion primitives are modeled as Gaussian Processes. When new data is available, the motion primitives learn… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  40. arXiv:1911.03721  [pdf, other

    math.OC cs.RO

    Distributed Certifiably Correct Pose-Graph Optimization

    Authors: Yulun Tian, Kasra Khosoussi, David M. Rosen, Jonathan P. How

    Abstract: This paper presents the first certifiably correct algorithm for distributed pose-graph optimization (PGO), the backbone of modern collaborative simultaneous localization and map** (CSLAM) and camera network localization (CNL) systems. Our method is based upon a sparse semidefinite relaxation that we prove provides globally-optimal PGO solutions under moderate measurement noise (matching the guar… ▽ More

    Submitted 18 May, 2021; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Updated convergence proofs. Paper accepted at T-RO

  41. arXiv:1910.12908  [pdf, other

    cs.RO cs.AI cs.LG

    Certified Adversarial Robustness for Deep Reinforcement Learning

    Authors: Björn Lütjens, Michael Everett, Jonathan P. How

    Abstract: Deep Neural Network-based systems are now the state-of-the-art in many robotics tasks, but their application in safety-critical domains remains dangerous without formal guarantees on network robustness. Small perturbations to sensor inputs (from noise or adversarial examples) are often enough to change network-based decisions, which was already shown to cause an autonomous vehicle to swerve into o… ▽ More

    Submitted 6 March, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Published at Conference on Robot Learning (CoRL) 2019; (v2) contains minor updates to related works; (v3) acknowledged AWS

    Journal ref: Proceedings of Machine Learning Research (PMLR) Vol. 100, 2019

  42. Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning

    Authors: Michael Everett, Yu Fan Chen, Jonathan P. How

    Abstract: Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over sho… ▽ More

    Submitted 25 January, 2021; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.01956

  43. arXiv:1910.10763  [pdf, ps, other

    cs.SI

    Representation Learning in Heterogeneous Professional Social Networks with Ambiguous Social Connections

    Authors: Baoxu Shi, Jaewon Yang, Tim Weninger, **g How, Qi He

    Abstract: Network representations have been shown to improve performance within a variety of tasks, including classification, clustering, and link prediction. However, most models either focus on moderate-sized, homogeneous networks or require a significant amount of auxiliary input to be provided by the user. Moreover, few works have studied network representations in real-world heterogeneous social networ… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 10 pages, accepted at IEEE BigData 2019

  44. arXiv:1909.11071  [pdf, other

    eess.SY

    Dynamic Landing of an Autonomous Quadrotor on a Moving Platform in Turbulent Wind Conditions

    Authors: Aleix Paris, Brett T. Lopez, Jonathan P. How

    Abstract: Autonomous landing on a moving platform presents unique challenges for multirotor vehicles, including the need to accurately localize the platform, fast trajectory planning, and precise/robust control. Previous works studied this problem but most lack explicit consideration of the wind disturbance, which typically leads to slow descents onto the platform. This work presents a fully autonomous visi… ▽ More

    Submitted 13 March, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: 7 pages, 8 figures, ICRA2020 accepted paper

  45. arXiv:1909.08735  [pdf, other

    cs.AI cs.LG

    Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games

    Authors: Macheng Shen, Jonathan P. How

    Abstract: This paper presents an algorithmic framework for learning robust policies in asymmetric imperfect-information games, where the joint reward could depend on the uncertain opponent type (a private information known only to the opponent itself and its ally). In order to maximize the reward, the protagonist agent has to infer the opponent type through agent modeling. We use multiagent reinforcement le… ▽ More

    Submitted 3 March, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

  46. arXiv:1909.05004  [pdf, other

    cs.LG cs.RO stat.ML

    Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

    Authors: Arpan Kusari, Jonathan P. How

    Abstract: A common approach for defining a reward function for Multi-objective Reinforcement Learning (MORL) problems is the weighted sum of the multiple objectives. The weights are then treated as design parameters dependent on the expertise (and preference) of the person performing the learning, with the typical result that a new solution is required for any change in these settings. This paper investigat… ▽ More

    Submitted 3 March, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Accepted at ICRA 2020

  47. arXiv:1908.10541  [pdf, other

    cs.RO

    Search and Rescue under the Forest Canopy using Multiple UAVs

    Authors: Yulun Tian, Katherine Liu, Kyel Ok, Loc Tran, Danette Allen, Nicholas Roy, Jonathan P. How

    Abstract: We present a multi-robot system for GPS-denied search and rescue under the forest canopy. Forests are particularly challenging environments for collaborative exploration and map**, in large part due to the existence of severe perceptual aliasing which hinders reliable loop closure detection for mutual localization and map fusion. Our proposed system features unmanned aerial vehicles (UAVs) that… ▽ More

    Submitted 7 June, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: IJRR revision

  48. arXiv:1908.09171  [pdf, other

    cs.RO cs.AI cs.LG

    Planning Beyond the Sensing Horizon Using a Learned Context

    Authors: Michael Everett, Justin Miller, Jonathan P. How

    Abstract: Last-mile delivery systems commonly propose the use of autonomous robotic vehicles to increase scalability and efficiency. The economic inefficiency of collecting accurate prior maps for navigation motivates the use of planning algorithms that operate in unmapped environments. However, these algorithms typically waste time exploring regions that are unlikely to contain the delivery destination. Co… ▽ More

    Submitted 1 June, 2020; v1 submitted 24 August, 2019; originally announced August 2019.

  49. arXiv:1908.03790  [pdf, other

    cs.RO

    Towards Online Observability-Aware Trajectory Optimization for Landmark-based Estimators

    Authors: Kristoffer M. Frey, Ted J. Steiner, Jonathan P. How

    Abstract: As autonomous systems increasingly rely on onboard sensing for localization and perception, the parallel tasks of motion planning and state estimation become more strongly coupled. This coupling is well-captured by augmenting the planning objective with a posterior-covariance penalty -- however, prediction of the estimator covariance is challenging when the observation model depends on unknown lan… ▽ More

    Submitted 10 September, 2020; v1 submitted 10 August, 2019; originally announced August 2019.

    Comments: Preprint; 25 pages

  50. arXiv:1907.06553  [pdf, other

    eess.SY

    Dynamic Tube MPC for Nonlinear Systems

    Authors: Brett T. Lopez, Jean-Jacques E. Slotine, Jonathan P. How

    Abstract: Modeling error or external disturbances can severely degrade the performance of Model Predictive Control (MPC) in real-world scenarios. Robust MPC (RMPC) addresses this limitation by optimizing over feedback policies but at the expense of increased computational complexity. Tube MPC is an approximate solution strategy in which a robust controller, designed offline, keeps the system in an invariant… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.