Skip to main content

Showing 1–50 of 174 results for author: Kochenderfer, M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14761  [pdf, other

    cs.RO cs.AI eess.SY

    Diffusion-Based Failure Sampling for Cyber-Physical Systems

    Authors: Harrison Delecki, Marc R. Schlichting, Mansur Arief, Anthony Corso, Marcell Vazquez-Chanlatte, Mykel J. Kochenderfer

    Abstract: Validating safety-critical autonomous systems in high-dimensional domains such as robotics presents a significant challenge. Existing black-box approaches based on Markov chain Monte Carlo may require an enormous number of samples, while methods based on importance sampling often rely on simple parametric families that may struggle to represent the distribution over failures. We propose to sample… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review at RA-L

  2. arXiv:2405.19570  [pdf, other

    cs.MA cs.RO

    Distributed Online Planning for Min-Max Problems in Networked Markov Games

    Authors: Alexandros E. Tzikas, **kyoo Park, Mykel J. Kochenderfer, Ross E. Allen

    Abstract: Min-max problems are important in multi-agent sequential decision-making because they improve the performance of the worst-performing agent in the network. However, solving the multi-agent min-max problem is challenging. We propose a modular, distributed, online planning-based algorithm that is able to approximate the solution of the min-max objective in networked Markov games, assuming that the a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to appear in the IEEE Robotics and Automation Letters

  3. arXiv:2405.09822  [pdf, other

    cs.RO

    SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

    Authors: Muhammad Fadhil Ginting, Sung-Kyun Kim, David D. Fan, Matteo Palieri, Mykel J. Kochenderfer, Ali-akbar Agha-Mohammadi

    Abstract: This paper addresses the problem of object-goal navigation in autonomous inspections in real-world environments. Object-goal navigation is crucial to enable effective inspections in various settings, often requiring the robot to identify the target object within a large search space. Current object inspection methods fall short of human efficiency because they typically cannot bootstrap prior and… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2405.00644  [pdf, other

    cs.AI

    ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

    Authors: Robert J. Moss, Arec Jamgochian, Johannes Fischer, Anthony Corso, Mykel J. Kochenderfer

    Abstract: To plan safely in uncertain environments, agents must balance utility with safety constraints. Safe planning problems can be modeled as a chance-constrained partially observable Markov decision process (CC-POMDP) and solutions often use expensive rollouts or heuristics to estimate the optimal value and action-selection policy. This work introduces the ConstrainedZero policy iteration algorithm tha… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: In Proceedings of the 2024 International Joint Conference on Artificial Intelligence (IJCAI)

  5. arXiv:2404.06940  [pdf, other

    cs.RO

    Robotic Learning for Adaptive Informative Path Planning

    Authors: Marija Popovic, Joshua Ott, Julius Rückin, Mykel J. Kochenderfer

    Abstract: Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 22 pages, 1 figure

  6. arXiv:2403.17358  [pdf, other

    cs.AI

    Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

    Authors: Paula Stocco, Suhas Chundi, Arec Jamgochian, Mykel J. Kochenderfer

    Abstract: Lagrangian-guided Monte Carlo tree search with global dual ascent has been applied to solve large constrained partially observable Markov decision processes (CPOMDPs) online. In this work, we demonstrate that these global dual parameters can lead to myopic action selection during exploration, ultimately leading to suboptimal decision making. To address this, we introduce history-dependent dual var… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to the 2024 International Conference on Automated Planning and Scheduling (ICAPS)

  7. arXiv:2402.09388  [pdf, other

    cs.AI

    Entropy-regularized Point-based Value Iteration

    Authors: Harrison Delecki, Marcell Vazquez-Chanlatte, Esen Yel, Kyle Wray, Tomer Arnon, Stefan Witwicki, Mykel J. Kochenderfer

    Abstract: Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty because they rely on an exact model and tend to commit to a single optimal behavior. Inspired by results in the model-free setting, we propose an entropy-regulari… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  8. arXiv:2402.08841  [pdf, other

    cs.RO

    Approximate Sequential Optimization for Informative Path Planning

    Authors: Joshua Ott, Mykel J. Kochenderfer, Stephen Boyd

    Abstract: We consider the problem of finding an informative path through a graph, given initial and terminal nodes and a given maximum path length. We assume that a linear noise corrupted measurement is taken at each node of an underlying unknown vector that we wish to estimate. The informativeness is measured by the reduction in uncertainty in our estimate, evaluated using several metrics. We present a con… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  9. arXiv:2401.17191  [pdf, other

    cs.RO

    Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

    Authors: Muhammad Fadhil Ginting, David D. Fan, Sung-Kyun Kim, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

    Abstract: This paper addresses the problem of autonomous robotic inspection in complex and unknown environments. This capability is crucial for efficient and precise inspections in various real-world scenarios, even when faced with perceptual uncertainty and lack of prior knowledge of the environment. Existing methods for real-world autonomous inspections typically rely on predefined targets and waypoints a… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  10. arXiv:2401.15838  [pdf, other

    stat.ML cs.LG cs.MA math.OC stat.CO

    Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

    Authors: Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

    Abstract: Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literatur… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  11. arXiv:2401.12275  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation

    Authors: Jiachen Li, Chuanbo Hua, Hengbo Ma, **kyoo Park, Victoria Dax, Mykel J. Kochenderfer

    Abstract: Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning. While modeling pairwise relations has been widely studied in multi-agent interacting systems, the ability to capture larger-scale group-wise activities is limited. In this paper, we propose a systematic relational reasoning approach with explicit in… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 19 pages, 8 figures, 6 tables

  12. arXiv:2401.10949  [pdf, ps, other

    cs.MA cs.LG eess.SY

    The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: This paper explores the integration of optimal transport (OT) theory with multi-agent reinforcement learning (MARL). This integration uses OT to handle distributions and transportation problems to enhance the efficiency, coordination, and adaptability of MARL. There are five key areas where OT can impact MARL: (1) policy alignment, where OT's Wasserstein metric is used to align divergent agent str… ▽ More

    Submitted 24 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  13. arXiv:2401.05610  [pdf, other

    cs.LG cs.AI

    Graph Q-Learning for Combinatorial Optimization

    Authors: Victoria M. Dax, Jiachen Li, Kevin Leahy, Mykel J. Kochenderfer

    Abstract: Graph-structured data is ubiquitous throughout natural and social sciences, and Graph Neural Networks (GNNs) have recently been shown to be effective at solving prediction and inference problems on graph data. In this paper, we propose and demonstrate that GNNs can be applied to solve Combinatorial Optimization (CO) problems. CO concerns optimizing a function over a discrete solution space that is… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Journal ref: GLIndA Workshop NeurIPS 2022

  14. Disentangled Neural Relational Inference for Interpretable Motion Prediction

    Authors: Victoria M. Dax, Jiachen Li, Enna Sachdeva, Nakul Agarwal, Mykel J. Kochenderfer

    Abstract: Effective interaction modeling and behavior prediction of dynamic agents play a significant role in interactive motion planning for autonomous robots. Although existing methods have improved prediction accuracy, few research efforts have been devoted to enhancing prediction model interpretability and out-of-distribution (OOD) generalizability. This work addresses these two challenging aspects by d… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Journal ref: IEEE Robotics and Automation Letters, Date: FEBRUARY 2024 , Volume: 9, Issue: 2, ISSN: 2377-3766, pp1452-1459

  15. arXiv:2311.16091  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation

    Authors: Jiachen Li, David Isele, Kanghoon Lee, **kyoo Park, Kikuo Fujimura, Mykel J. Kochenderfer

    Abstract: Deep reinforcement learning (DRL) provides a promising way for intelligent agents (e.g., autonomous vehicles) to learn to navigate complex scenarios. However, DRL with neural networks as function approximators is typically considered a black box with little explainability and often suffers from suboptimal performance, especially for autonomous navigation in highly interactive multi-agent environme… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 18 pages, 14 figures

  16. arXiv:2311.00192  [pdf, other

    cs.RO cs.AI

    Large-Scale Multi-Robot Assembly Planning for Autonomous Manufacturing

    Authors: Kyle Brown, Dylan M. Asmar, Mac Schwager, Mykel J. Kochenderfer

    Abstract: Mobile autonomous robots have the potential to revolutionize manufacturing processes. However, employing large robot fleets in manufacturing requires addressing challenges including collision-free movement in a shared workspace, effective multi-robot collaboration to manipulate and transport large payloads, complex task allocation due to coupled manufacturing processes, and spatial planning for pa… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Repository: https://github.com/sisl/ConstructionBots.jl. Under review

  17. arXiv:2310.20054  [pdf, other

    cs.AI cs.RO

    Constrained Hierarchical Monte Carlo Belief-State Planning

    Authors: Arec Jamgochian, Hugo Buurmeijer, Kyle H. Wray, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Optimal plans in Constrained Partially Observable Markov Decision Processes (CPOMDPs) maximize reward objectives while satisfying hard cost constraints, generalizing safe planning under state and transition uncertainty. Unfortunately, online CPOMDP planning is extremely difficult in large or continuous problem domains. In many large robotic domains, hierarchical decomposition can simplify planning… ▽ More

    Submitted 26 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  18. arXiv:2310.01723  [pdf, other

    cs.RO

    Predicting Future Spatiotemporal Occupancy Grids with Semantics for Autonomous Driving

    Authors: Maneekwan Toyungyernsub, Esen Yel, Jiachen Li, Mykel J. Kochenderfer

    Abstract: For autonomous vehicles to proactively plan safe trajectories and make informed decisions, they must be able to predict the future occupancy states of the local environment. However, common issues with occupancy prediction include predictions where moving objects vanish or become blurred, particularly at longer time horizons. We propose an environment prediction framework that incorporates environ… ▽ More

    Submitted 11 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at the 35th IEEE Intelligent Vehicles Symposium 2024

  19. arXiv:2309.13893  [pdf, other

    cs.RO cs.AI cs.CV

    Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments

    Authors: Bernard Lange, Jiachen Li, Mykel J. Kochenderfer

    Abstract: Navigating complex and dynamic environments requires autonomous vehicles (AVs) to reason about both visible and occluded regions. This involves predicting the future motion of observed agents, inferring occluded ones, and modeling their interactions based on vectorized scene representations of the partially observable environment. However, prior work on occlusion inference and trajectory predictio… ▽ More

    Submitted 8 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)

  20. arXiv:2309.12474  [pdf, other

    cs.RO cs.AI cs.CY cs.ET eess.SY

    SAVME: Efficient Safety Validation for Autonomous Systems Using Meta-Learning

    Authors: Marc R. Schlichting, Nina V. Boord, Anthony L. Corso, Mykel J. Kochenderfer

    Abstract: Discovering potential failures of an autonomous system is important prior to deployment. Falsification-based methods are often used to assess the safety of such systems, but the cost of running many accurate simulation can be high. The validation can be accelerated by identifying critical failure scenarios for the system under test and by reducing the simulation runtime. We propose a Bayesian appr… ▽ More

    Submitted 30 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted for ITSC 2023

  21. arXiv:2309.08778  [pdf, other

    cs.LO cs.MS

    Satisfiability.jl: Satisfiability Modulo Theories in Julia

    Authors: Emiko Soroka, Mykel J. Kochenderfer, Sanjay Lall

    Abstract: Satisfiability modulo theories (SMT) is a core tool in formal verification. While the SMT-LIB specification language can be used to interact with theorem proving software, a high-level interface allows for faster and easier specifications of complex SMT formulae. In this paper we present a novel open-source package for interacting with SMT-LIB compliant solvers in the Julia programming language.

    Submitted 15 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 7 pages, revised from a previous longer version to comply with a conference length requirement. Submitted to NASA Formal Methods 2024

    ACM Class: D.2.4

  22. arXiv:2307.10586  [pdf, other

    cs.LG

    A Holistic Assessment of the Reliability of Machine Learning Systems

    Authors: Anthony Corso, David Karamadian, Romeo Valentin, Mary Cooper, Mykel J. Kochenderfer

    Abstract: As machine learning (ML) systems increasingly permeate high-stakes settings such as healthcare, transportation, military, and national security, concerns regarding their reliability have emerged. Despite notable progress, the performance of these systems can significantly diminish due to adversarial attacks or environmental changes, leading to overconfident predictions, failures to detect input fa… ▽ More

    Submitted 29 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  23. arXiv:2307.10160  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Robust Driving Policy Learning with Guided Meta Reinforcement Learning

    Authors: Kanghoon Lee, Jiachen Li, David Isele, **kyoo Park, Kikuo Fujimura, Mykel J. Kochenderfer

    Abstract: Although deep reinforcement learning (DRL) has shown promising results for autonomous navigation in interactive traffic scenarios, existing work typically adopts a fixed behavior policy to control social vehicles in the training environment. This may cause the learned driving policy to overfit the environment, making it difficult to interact well with vehicles with different, unseen behaviors. In… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: ITSC 2023

  24. arXiv:2307.01371  [pdf, other

    cs.RO cs.AI

    Efficient Determination of Safety Requirements for Perception Systems

    Authors: Sydney M. Katz, Anthony L. Corso, Esen Yel, Mykel J. Kochenderfer

    Abstract: Perception systems operate as a subcomponent of the general autonomy stack, and perception system designers often need to optimize performance characteristics while maintaining safety with respect to the overall closed-loop system. For this reason, it is useful to distill high-level safety requirements into component-level requirements on the perception system. In this work, we focus on efficientl… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 10 pages, 14 figures, submitted to the 2023 Digital Avionics Systems Conference

  25. arXiv:2306.15605  [pdf, other

    cs.RO

    Deep Normalizing Flows for State Estimation

    Authors: Harrison Delecki, Liam A. Kruse, Marc R. Schlichting, Mykel J. Kochenderfer

    Abstract: Safe and reliable state estimation techniques are a critical component of next-generation robotic systems. Agents in such systems must be able to reason about the intentions and trajectories of other agents for safe and efficient motion planning. However, classical state estimation techniques such as Gaussian filters often lack the expressive power to represent complex underlying distributions, es… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted to FUSION 2023

  26. arXiv:2306.11203  [pdf, other

    cs.CV cs.LG

    AVOIDDS: Aircraft Vision-based Intruder Detection Dataset and Simulator

    Authors: Elysia Q. Smyers, Sydney M. Katz, Anthony L. Corso, Mykel J. Kochenderfer

    Abstract: Designing robust machine learning systems remains an open problem, and there is a need for benchmark problems that cover both environmental changes and evaluation on a downstream task. In this work, we introduce AVOIDDS, a realistic object detection benchmark for the vision-based aircraft detect-and-avoid problem. We provide a labeled dataset consisting of 72,000 photorealistic images of intruder… ▽ More

    Submitted 26 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to and presented at NeurIPS 2023, Datasets and Benchmarks Track; fixed link formatting in the abstract

  27. arXiv:2306.00249  [pdf, other

    cs.AI

    BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations

    Authors: Robert J. Moss, Anthony Corso, Jef Caers, Mykel J. Kochenderfer

    Abstract: Real-world planning problems, including autonomous driving and sustainable energy applications like carbon storage and resource exploration, have recently been modeled as partially observable Markov decision processes (POMDPs) and solved using approximate methods. To solve high-dimensional POMDPs in practice, state-of-the-art methods use online planning with problem-specific heuristics to reduce p… ▽ More

    Submitted 16 December, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 16 pages

  28. arXiv:2305.18633  [pdf, other

    cs.RO

    Experience Filter: Using Past Experiences on Unseen Tasks or Environments

    Authors: Anil Yildiz, Esen Yel, Anthony L. Corso, Kyle H. Wray, Stefan J. Witwicki, Mykel J. Kochenderfer

    Abstract: One of the bottlenecks of training autonomous vehicle (AV) agents is the variability of training environments. Since learning optimal policies for unseen environments is often very costly and requires substantial data collection, it becomes computationally intractable to train the agent on every possible environment or task the AV may encounter. This paper introduces a zero-shot filtering approach… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2023

  29. arXiv:2305.09930  [pdf, other

    cs.RO cs.LG stat.ML

    Model-based Validation as Probabilistic Inference

    Authors: Harrison Delecki, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Estimating the distribution over failures is a key step in validating autonomous systems. Existing approaches focus on finding failures for a small range of initial conditions or make restrictive assumptions about the properties of the system under test. We frame estimating the distribution over failure trajectories for sequential systems as Bayesian inference. Our model-based approach represents… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

  30. arXiv:2305.02449  [pdf, other

    cs.LG stat.AP

    Bayesian Safety Validation for Failure Probability Estimation of Black-Box Systems

    Authors: Robert J. Moss, Mykel J. Kochenderfer, Maxime Gariel, Arthur Dubois

    Abstract: Estimating the probability of failure is an important step in the certification of safety-critical systems. Efficient estimation methods are often needed due to the challenges posed by high-dimensional input spaces, risky test scenarios, and computationally expensive simulators. This work frames the problem of black-box safety validation as a Bayesian optimization problem and introduces a method t… ▽ More

    Submitted 29 June, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: AIAA Journal of Aerospace Information Systems (JAIS) 21.7 (2024): 533-546

  31. arXiv:2304.09352  [pdf, other

    cs.AI eess.SY physics.flu-dyn

    Optimizing Carbon Storage Operations for Long-Term Safety

    Authors: Yizheng Wang, Markus Zechner, Gege Wen, Anthony Louis Corso, John Michael Mern, Mykel J. Kochenderfer, Jef Karel Caers

    Abstract: To combat global warming and mitigate the risks associated with climate change, carbon capture and storage (CCS) has emerged as a crucial technology. However, safely sequestering CO2 in geological formations for long-term storage presents several challenges. In this study, we address these issues by modeling the decision-making process for carbon storage operations as a partially observable Markov… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  32. arXiv:2304.00645  [pdf, other

    cs.RO

    Safe and Efficient Navigation in Extreme Environments using Semantic Belief Graphs

    Authors: Muhammad Fadhil Ginting, Sung-Kyun Kim, Oriana Peltzer, Joshua Ott, Sunggoo Jung, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

    Abstract: To achieve autonomy in unknown and unstructured environments, we propose a method for semantic-based planning under perceptual uncertainty. This capability is crucial for safe and efficient robot navigation in environment with mobility-stressing elements that require terrain-specific locomotion policies. We propose the Semantic Belief Graph (SBG), a geometric- and semantic-based representation of… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  33. arXiv:2303.09981  [pdf, other

    cs.LG

    Inferring Traffic Models in Terminal Airspace from Flight Tracks and Procedures

    Authors: Soyeon Jung, Mykel J. Kochenderfer

    Abstract: Realistic aircraft trajectory models are useful in the design and validation of air traffic management (ATM) systems. Models of aircraft operated under instrument flight rules (IFR) require capturing the variability inherent in how aircraft follow standard flight procedures. The variability in aircraft behavior varies among flight stages. In this paper, we propose a probabilistic model that can le… ▽ More

    Submitted 30 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  34. arXiv:2303.03530  [pdf, other

    cs.RO

    Incorporating Human Path Preferences in Robot Navigation with Minimal Interventions

    Authors: Oriana Peltzer, Dylan M. Asmar, Mac Schwager, Mykel J. Kochenderfer

    Abstract: Robots that can effectively understand human intentions from actions are crucial for successful human-robot collaboration. In this work, we address the challenge of a robot navigating towards an unknown goal while also accounting for a human's preference for a particular path in the presence of obstacles. This problem is particularly challenging when both the goal and path preference are unknown a… ▽ More

    Submitted 15 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  35. arXiv:2302.03164  [pdf, other

    cs.RO

    Adaptive Coverage Path Planning for Efficient Exploration of Unknown Environments

    Authors: Amanda Bouman, Joshua Ott, Sung-Kyun Kim, Kenny Chen, Mykel J. Kochenderfer, Brett Lopez, Ali-akbar Agha-mohammadi, Joel Burdick

    Abstract: We present a method for solving the coverage problem with the objective of autonomously exploring an unknown environment under mission time constraints. Here, the robot is tasked with planning a path over a horizon such that the accumulated area swept out by its sensor footprint is maximized. Because this problem exhibits a diminishing returns property known as submodularity, we choose to formulat… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  36. arXiv:2301.02362  [pdf, other

    cs.RO

    Fast and Scalable Signal Inference for Active Robotic Source Seeking

    Authors: Christopher E. Denniston, Oriana Peltzer, Joshua Ott, Sangwoo Moon, Sung-Kyun Kim, Gaurav S. Sukhatme, Mykel J. Kochenderfer, Mac Schwager, Ali-akbar Agha-mohammadi

    Abstract: In active source seeking, a robot takes repeated measurements in order to locate a signal source in a cluttered and unknown environment. A key component of an active source seeking robot planner is a model that can produce estimates of the signal at unknown locations with uncertainty quantification. This model allows the robot to plan for future measurements in the environment. Traditionally, this… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 6 pages, Submitted to ICRA 2023 - Contains Appendix

  37. arXiv:2212.14118  [pdf, other

    eess.SY cs.LG

    Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

    Authors: Zahra Shahrooei, Mykel J. Kochenderfer, Ali Baheri

    Abstract: Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety speci… ▽ More

    Submitted 28 April, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 7 pages, 8 figures, Accepted for the 2023 European Control Conference (ECC)

  38. arXiv:2212.12154  [pdf, other

    cs.AI

    Online Planning for Constrained POMDPs with Continuous Spaces through Dual Ascent

    Authors: Arec Jamgochian, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Rather than augmenting rewards with penalties for undesired behavior, Constrained Partially Observable Markov Decision Processes (CPOMDPs) plan safely by imposing inviolable hard constraint value budgets. Previous work performing online planning for CPOMDPs has only been applied to discrete action and observation spaces. In this work, we propose algorithms for online CPOMDP planning for continuous… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Submitted to ICAPS-23

  39. arXiv:2212.00669  [pdf, other

    physics.geo-ph cs.AI

    A POMDP Model for Safe Geological Carbon Sequestration

    Authors: Anthony Corso, Yizheng Wang, Markus Zechner, Jef Caers, Mykel J. Kochenderfer

    Abstract: Geological carbon capture and sequestration (CCS), where CO$_2$ is stored in subsurface formations, is a promising and scalable approach for reducing global emissions. However, if done incorrectly, it may lead to earthquakes and leakage of CO$_2$ back to the surface, harming both humans and the environment. These risks are exacerbated by the large amount of uncertainty in the structure of the stor… ▽ More

    Submitted 24 October, 2022; originally announced December 2022.

    Comments: Accepted at NeurIPS 2022 Workshop on Tackling Climate Change with Machine Learning

  40. arXiv:2211.12470  [pdf, ps, other

    cs.LG cs.AI

    A Deep Reinforcement Learning Approach to Rare Event Estimation

    Authors: Anthony Corso, Kyu-Young Kim, Shubh Gupta, Grace Gao, Mykel J. Kochenderfer

    Abstract: An important step in the design of autonomous systems is to evaluate the probability that a failure will occur. In safety-critical domains, the failure probability is extremely small so that the evaluation of a policy through Monte Carlo sampling is inefficient. Adaptive importance sampling approaches have been developed for rare event estimation but do not scale well to sequential systems with lo… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  41. arXiv:2211.08701  [pdf, other

    cs.RO cs.CV cs.LG

    Interpretable Self-Aware Neural Networks for Robust Trajectory Prediction

    Authors: Masha Itkina, Mykel J. Kochenderfer

    Abstract: Although neural networks have seen tremendous success as predictive models in a variety of domains, they can be overly confident in their predictions on out-of-distribution (OOD) data. To be viable for safety-critical applications, like autonomous vehicles, neural networks must accurately estimate their epistemic or model uncertainty, achieving a level of system self-awareness. Techniques for epis… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Conference on Robot Learning (CoRL) 2022, 15 pages, 4 figures

    ACM Class: I.2.9; I.2.6; I.2.10

  42. arXiv:2210.17540  [pdf, other

    cs.LG cs.MA

    Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning

    Authors: Jennifer She, Jayesh K. Gupta, Mykel J. Kochenderfer

    Abstract: Sparse and delayed rewards pose a challenge to single agent reinforcement learning. This challenge is amplified in multi-agent reinforcement learning (MARL) where credit assignment of these rewards needs to happen not only across time, but also across agents. We propose Agent-Time Attention (ATA), a neural network model with auxiliary losses for redistributing sparse and delayed rewards in collabo… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Full version of the Extended Abstract accepted at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2022

  43. arXiv:2210.08975  [pdf, other

    cs.CY cs.AI

    Prioritizing emergency evacuations under compounding levels of uncertainty

    Authors: Lisa J. Einstein, Robert J. Moss, Mykel J. Kochenderfer

    Abstract: Well-executed emergency evacuations can save lives and reduce suffering. However, decision makers struggle to determine optimal evacuation policies given the chaos, uncertainty, and value judgments inherent in emergency evacuations. We propose and analyze a decision support tool for pre-crisis training exercises for teams preparing for civilian evacuations and explore the tool in the case of the 2… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Submitted to the IEEE Global Humanitarian Technology Conference

  44. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  45. arXiv:2210.01249  [pdf, other

    cs.RO cs.CV

    LOPR: Latent Occupancy PRediction using Generative Models

    Authors: Bernard Lange, Masha Itkina, Mykel J. Kochenderfer

    Abstract: Environment prediction frameworks are integral for autonomous vehicles, enabling safe navigation in dynamic environments. LiDAR generated occupancy grid maps (L-OGMs) offer a robust bird's eye-view scene representation that facilitates joint scene predictions without relying on manual labeling unlike commonly used trajectory prediction frameworks. Prior approaches have optimized deterministic L-OG… ▽ More

    Submitted 24 August, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  46. arXiv:2209.14076  [pdf, other

    eess.SY cs.LG cs.RO

    Backward Reachability Analysis of Neural Feedback Loops: Techniques for Linear and Nonlinear Systems

    Authors: Nicholas Rober, Sydney M. Katz, Chelsea Sidrane, Esen Yel, Michael Everett, Mykel J. Kochenderfer, Jonathan P. How

    Abstract: As neural networks (NNs) become more prevalent in safety-critical applications such as control of vehicles, there is a growing need to certify that systems with NN components are safe. This paper presents a set of backward reachability approaches for safety certification of neural feedback loops (NFLs), i.e., closed-loop systems with NN control policies. While backward reachability strategies have… ▽ More

    Submitted 21 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 17 pages, 15 figures. Journal extension of arXiv:2204.08319

  47. arXiv:2209.13172  [pdf, other

    cs.RO cs.CV

    Dynamics-Aware Spatiotemporal Occupancy Prediction in Urban Environments

    Authors: Maneekwan Toyungyernsub, Esen Yel, Jiachen Li, Mykel J. Kochenderfer

    Abstract: Detection and segmentation of moving obstacles, along with prediction of the future occupancy states of the local environment, are essential for autonomous vehicles to proactively make safe and informed decisions. In this paper, we propose a framework that integrates the two capabilities together using deep neural network architectures. Our method first detects and segments moving objects in the s… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted at 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

    ACM Class: I.2.9; I.2.10

  48. arXiv:2209.13160  [pdf, ps, other

    cs.AI

    Collaborative Decision Making Using Action Suggestions

    Authors: Dylan M. Asmar, Mykel J. Kochenderfer

    Abstract: The level of autonomy is increasing in systems spanning multiple domains, but these systems still experience failures. One way to mitigate the risk of failures is to integrate human oversight of the autonomous systems and rely on the human to take control when the autonomy fails. In this work, we formulate a method of collaborative decision making through action suggestions that improves action se… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Code is available at https://github.com/sisl/action_suggestions. Accepted to NeurIPS 2022

  49. arXiv:2209.07660  [pdf, other

    cs.AI cs.RO

    Sequential Bayesian Optimization for Adaptive Informative Path Planning with Multimodal Sensing

    Authors: Joshua Ott, Edward Balaban, Mykel J. Kochenderfer

    Abstract: Adaptive Informative Path Planning with Multimodal Sensing (AIPPMS) considers the problem of an agent equipped with multiple sensors, each with different sensing accuracy and energy costs. The agent's goal is to explore the environment and gather information subject to its resource constraints in unknown, partially observable environments. Previous work has focused on the less general Adaptive Inf… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  50. arXiv:2209.07096  [pdf, other

    cs.AI

    Multi-Objective Policy Gradients with Topological Constraints

    Authors: Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel

    Abstract: Multi-objective optimization models that encode ordered sequential constraints provide a solution to model various challenging problems including encoding preferences, modeling a curriculum, and enforcing measures of safety. A recently developed theory of topological Markov decision processes (TMDPs) captures this range of problems for the case of discrete states and actions. In this work, we exte… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.