Skip to main content

Showing 1–19 of 19 results for author: Baheri, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.11774  [pdf, other

    cs.LG eess.SY

    Optimal Transport-Assisted Risk-Sensitive Q-Learning

    Authors: Zahra Shahrooei, Ali Baheri

    Abstract: The primary goal of reinforcement learning is to develop decision-making policies that prioritize optimal performance without considering risk or safety. In contrast, safe reinforcement learning aims to mitigate or avoid unsafe states. This paper presents a risk-sensitive Q-learning algorithm that leverages optimal transport theory to enhance the agent safety. By integrating optimal transport into… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2403.05925  [pdf, other

    eess.SY

    BEACON: A Bayesian Evolutionary Approach for Counterexample Generation of Control Systems

    Authors: Joshua Yancosek, Ali Baheri

    Abstract: The rigorous safety verification of control systems in critical applications is essential, given their increasing complexity and integration into everyday life. Simulation-based falsification approaches play a pivotal role in the safety verification of control systems, particularly within critical applications. These methods systematically explore the operational space of systems to identify confi… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  3. arXiv:2402.15893  [pdf, other

    eess.SY cs.LG

    Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning

    Authors: Lunet Yifru, Ali Baheri

    Abstract: Reinforcement learning (RL) has revolutionized decision-making across a wide range of domains over the past few decades. Yet, deploying RL policies in real-world scenarios presents the crucial challenge of ensuring safety. Traditional safe RL approaches have predominantly focused on incorporating predefined safety constraints into the policy learning process. However, this reliance on predefined s… ▽ More

    Submitted 24 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  4. arXiv:2401.10949  [pdf, ps, other

    cs.MA cs.LG eess.SY

    The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: This paper explores the integration of optimal transport (OT) theory with multi-agent reinforcement learning (MARL). This integration uses OT to handle distributions and transportation problems to enhance the efficiency, coordination, and adaptability of MARL. There are five key areas where OT can impact MARL: (1) policy alignment, where OT's Wasserstein metric is used to align divergent agent str… ▽ More

    Submitted 24 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  5. arXiv:2310.12055  [pdf, ps, other

    cs.LG eess.SY math.OC

    Understanding Reward Ambiguity Through Optimal Transport Theory in Inverse Reinforcement Learning

    Authors: Ali Baheri

    Abstract: In inverse reinforcement learning (IRL), the central objective is to infer underlying reward functions from observed expert behaviors in a way that not only explains the given data but also generalizes to unseen scenarios. This ensures robustness against reward ambiguity where multiple reward functions can equally explain the same expert behaviors. While significant efforts have been made in addre… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  6. arXiv:2309.06239  [pdf, ps, other

    cs.LG eess.SY

    Risk-Aware Reinforcement Learning through Optimal Transport Theory

    Authors: Ali Baheri

    Abstract: In the dynamic and uncertain environments where reinforcement learning (RL) operates, risk management becomes a crucial factor in ensuring reliable decision-making. Traditional RL approaches, while effective in reward optimization, often overlook the landscape of potential risks. In response, this paper pioneers the integration of Optimal Transport (OT) theory with RL to create a risk-aware framew… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  7. arXiv:2305.06796  [pdf, ps, other

    cs.LG eess.SY

    Towards Theoretical Understanding of Data-Driven Policy Refinement

    Authors: Ali Baheri

    Abstract: This paper presents an approach for data-driven policy refinement in reinforcement learning, specifically designed for safety-critical applications. Our methodology leverages the strengths of data-driven optimization and reinforcement learning to enhance policy safety and optimality through iterative refinement. Our principal contribution lies in the mathematical formulation of this data-driven po… ▽ More

    Submitted 15 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted at the "Bridging the Gap Between AI Planning and Reinforcement Learning (PRL)" workshop at ICAPS 2023

  8. arXiv:2305.06111  [pdf, ps, other

    eess.SY

    Joint Falsification and Fidelity Settings Optimization for Validation of Safety-Critical Systems: A Theoretical Analysis

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: Safety validation is a crucial component in the development and deployment of autonomous systems, such as self-driving vehicles and robotic systems. Ensuring safe operation necessitates extensive testing and verification of control policies, typically conducted in simulation environments. High-fidelity simulators accurately model real-world dynamics but entail high computational costs, limiting th… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Submitted to the 20th International Conference on Quantitative Evaluation of Systems (QEST 2023)

  9. arXiv:2305.00576  [pdf, ps, other

    eess.SY cs.LG

    Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning

    Authors: Lunet Yifru, Ali Baheri

    Abstract: In many real-world applications, safety constraints for reinforcement learning (RL) algorithms are either unknown or not explicitly defined. We propose a framework that concurrently learns safety constraints and optimal RL policies in such environments, supported by theoretical guarantees. Our approach merges a logically-constrained RL algorithm with an evolutionary algorithm to synthesize signal… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted at the "Bridging the Gap Between AI Planning and Reinforcement Learning (PRL)" workshop at ICAPS 2023

  10. arXiv:2212.14118  [pdf, other

    eess.SY cs.LG

    Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

    Authors: Zahra Shahrooei, Mykel J. Kochenderfer, Ali Baheri

    Abstract: Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety speci… ▽ More

    Submitted 28 April, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 7 pages, 8 figures, Accepted for the 2023 European Control Conference (ECC)

  11. arXiv:2211.02147  [pdf, other

    eess.SY cs.LG

    A Survey on Reinforcement Learning in Aviation Applications

    Authors: Pouria Razzaghi, Amin Tabrizian, Wei Guo, Shulu Chen, Abenezer Taye, Ellis Thompson, Alexis Bregeon, Ali Baheri, Peng Wei

    Abstract: Compared with model-based control and optimization methods, reinforcement learning (RL) provides a data-driven, learning-based framework to formulate and solve sequential decision-making problems. The RL framework has become promising due to largely improved data availability and computing power in the aviation industry. Many aviation-based applications can be formulated or treated as sequential d… ▽ More

    Submitted 22 November, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

  12. arXiv:2205.04590  [pdf, other

    eess.SY cs.AI cs.LG

    A Verification Framework for Certifying Learning-Based Safety-Critical Aviation Systems

    Authors: Ali Baheri, Hao Ren, Benjamin Johnson, Pouria Razzaghi, Peng Wei

    Abstract: We present a safety verification framework for design-time and run-time assurance of learning-based components in aviation systems. Our proposed framework integrates two novel methodologies. From the design-time assurance perspective, we propose offline mixed-fidelity verification tools that incorporate knowledge from different levels of granularity in simulated environments. From the run-time ass… ▽ More

    Submitted 14 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 12 pages, 9 figures

  13. arXiv:2203.12416  [pdf, other

    cs.RO eess.SY

    A Framework for Controlling Multi-Robot Systems Using Bayesian Optimization and Linear Combination of Vectors

    Authors: Stephen Jacobs, R. Michael Butts, Yu Gu, Ali Baheri, Guilherme A. S. Pereira

    Abstract: We propose a general framework for creating parameterized control schemes for decentralized multi-robot systems. A variety of tasks can be seen in the decentralized multi-robot literature, each with many possible control schemes. For several of them, the agents choose control velocities using algorithms that extract information from the environment and combine that information in meaningful ways.… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 7 pages, 8 figures

  14. arXiv:2203.03451  [pdf, other

    eess.SY

    Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach

    Authors: Jared J. Beard, Ali Baheri

    Abstract: The increasing use of autonomous and semi-autonomous agents in society has made it crucial to validate their safety. However, the complex scenarios in which they are used may make formal verification impossible. To address this challenge, simulation-based safety validation is employed to test the complex system. Recent approaches using reinforcement learning are prone to excessive exploitation of… ▽ More

    Submitted 1 March, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 8 pages, 6 figures, 2 Algorithms, submitted to the 2023 7th IEEE Conference on Control Technology and Applications, Bridgetown, Barbados

  15. arXiv:2007.01698  [pdf, other

    eess.SY

    Safe Reinforcement Learning with Mixture Density Network: A Case Study in Autonomous Highway Driving

    Authors: Ali Baheri

    Abstract: This paper presents a safe reinforcement learning system for automated driving that benefits from multimodal future trajectory predictions. We propose a safety system that consists of two safety components: a heuristic safety and a learning-based safety. The heuristic safety module is based on common driving rules. On the other hand, the learning-based safety module is a data-driven safety rule th… ▽ More

    Submitted 17 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.12905

  16. arXiv:2003.08300  [pdf, other

    eess.SY eess.SP

    Vision-Based Autonomous Driving: A Model Learning Approach

    Authors: Ali Baheri, Ilya Kolmanovsky, Anouck Girard, H. Eric Tseng, Dimitar Filev

    Abstract: We present an integrated approach for perception and control for an autonomous vehicle and demonstrate this approach in a high-fidelity urban driving simulator. Our approach first builds a model for the environment, then trains a policy exploiting the learned model to identify the action to take at each time-step. To build a model for the environment, we leverage several deep learning algorithms.… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: 6

  17. arXiv:1910.12905  [pdf, other

    eess.SY cs.RO

    Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway Driving

    Authors: Ali Baheri, Subramanya Nageshrao, H. Eric Tseng, Ilya Kolmanovsky, Anouck Girard, Dimitar Filev

    Abstract: In this paper, we present a safe deep reinforcement learning system for automated driving. The proposed framework leverages merits of both rule-based and learning-based approaches for safety assurance. Our safety system consists of two modules namely handcrafted safety and dynamically-learned safety. The handcrafted safety module is a heuristic safety rule based on common driving practice that ens… ▽ More

    Submitted 23 April, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

  18. arXiv:1910.12901  [pdf, other

    eess.SY

    Waypoint Optimization Using Bayesian Optimization: A Case Study in Airborne Wind Energy Systems

    Authors: Ali Baheri, Chris Vermillion

    Abstract: We present a data-driven optimization framework that aims to address online adaptation of the flight path shape for an airborne wind energy system (AWE) that follows a repetitive path to generate power. Specifically, Bayesian optimization, which is a data-driven algorithm for finding the optimum of an unknown objective function, is utilized to solve the waypoint adaptation. To form a computational… ▽ More

    Submitted 16 November, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

  19. arXiv:1901.07521  [pdf, other

    eess.SY

    Economically Efficient Combined Plant and Controller Design Using Batch Bayesian Optimization: Mathematical Framework and Airborne Wind Energy Case Study

    Authors: Ali Baheri, Chris Vermillion

    Abstract: We present a novel data-driven nested optimization framework that addresses the problem of coupling between plant and controller optimization. This optimization strategy is tailored towards instances where a closed-form expression for the system dynamic response is unobtainable and simulations or experiments are necessary. Specifically, Bayesian Optimization, which is a data-driven technique for f… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.