Skip to main content

Showing 1–11 of 11 results for author: Nageshrao, S

.
  1. arXiv:2311.10041  [pdf, other

    cs.RO

    Interpretable Reinforcement Learning for Robotics and Continuous Control

    Authors: Rohan Paleja, Letian Chen, Yaru Niu, Andrew Silva, Zhaoxin Li, Songan Zhang, Chace Ritchie, Sugju Choi, Kimberlee Chestnut Chang, Hongtei Eric Tseng, Yan Wang, Subramanya Nageshrao, Matthew Gombolay

    Abstract: Interpretability in machine learning is critical for the safe deployment of learned policies across legally-regulated and safety-critical domains. While gradient-based approaches in reinforcement learning have achieved tremendous success in learning policies for continuous control problems such as robotics and autonomous driving, the lack of interpretability is a fundamental barrier to adoption. W… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.02352

  2. arXiv:2207.07829  [pdf, other

    cs.RO eess.SY

    Robust AI Driving Strategy for Autonomous Vehicles

    Authors: Subramanya Nageshrao, Yousaf Rahman, Vladimir Ivanovic, Mrdjan Jankovic, Eric Tseng, Michael Hafner, Dimitar Filev

    Abstract: There has been significant progress in sensing, perception, and localization for automated driving, However, due to the wide spectrum of traffic/road structure scenarios and the long tail distribution of human driver behavior, it has remained an open challenge for an intelligent vehicle to always know how to make and execute the best decision on road given available sensing / perception / localiza… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

  3. arXiv:2112.03232  [pdf, other

    eess.SY

    A Risk-Averse Preview-based $Q$-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles

    Authors: Majid Mazouchi, Subramanya Nageshrao, Hamidreza Modares

    Abstract: A risk-averse preview-based $Q$-learning planner is presented for navigation of autonomous vehicles. To this end, the multi-lane road ahead of a vehicle is represented by a finite-state non-stationary Markov decision process (MDP). A risk assessment unit module is then presented that leverages the preview information provided by sensors along with a stochastic reachability module to assign reward… ▽ More

    Submitted 18 October, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  4. arXiv:2105.05903  [pdf, other

    eess.SY

    Finite-time Koopman Identifier: A Unified Batch-online Learning Framework for Joint Learning of Koopman Structure and Parameters

    Authors: Majid Mazouchi, Subramanya Nageshrao, Hamidreza Modares

    Abstract: In this paper, a unified batch-online learning approach is introduced to learn a linear representation of nonlinear system dynamics using the Koopman operator. The presented system modeling approach leverages a novel incremental Koopman-based update law that retrieves a mini-batch of samples stored in a memory to not only minimizes the instantaneous Koopman operator's identification errors but als… ▽ More

    Submitted 26 December, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

  5. arXiv:2103.14606  [pdf, other

    eess.SY

    A Convex Programming Approach to Data-Driven Risk-Averse Reinforcement Learning

    Authors: Yuzhen Han, Majid Mazouchi, Subramanya Nageshrao, Hamidreza Modares

    Abstract: This paper presents a model-free reinforcement learning (RL) algorithm to solve the risk-averse optimal control (RAOC) problem for discrete-time nonlinear systems. While successful RL algorithms have been presented to learn optimal control solutions under epistemic uncertainties (i.e., lack of knowledge of system dynamics), they do so by optimizing the expected utility of outcomes, which ignores t… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  6. arXiv:2103.12558  [pdf, other

    cs.AI cs.LG eess.SY

    Assured Learning-enabled Autonomy: A Metacognitive Reinforcement Learning Framework

    Authors: Aquib Mustafa, Majid Mazouchi, Subramanya Nageshrao, Hamidreza Modares

    Abstract: Reinforcement learning (RL) agents with pre-specified reward functions cannot provide guaranteed safety across variety of circumstances that an uncertain system might encounter. To guarantee performance while assuring satisfaction of safety constraints across variety of circumstances, an assured autonomous control framework is presented in this paper by empowering RL algorithms with metacognitive… ▽ More

    Submitted 17 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  7. arXiv:2009.09521  [pdf, other

    cs.LG cs.NE eess.SY stat.ML

    Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems

    Authors: Yashesh Dhebar, Kalyanmoy Deb, Subramanya Nageshrao, Ling Zhu, Dimitar Filev

    Abstract: Black-box AI induction methods such as deep reinforcement learning (DRL) are increasingly being used to find optimal policies for a given control task. Although policies represented using a black-box AI are capable of efficiently executing the underlying control task and achieving optimal closed-loop performance, the developed control rules are often complex and neither interpretable nor explainab… ▽ More

    Submitted 6 April, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

    Comments: main paper: 12 pages (pages 1-12), Supplementary Document: 5 pages (from pages 13-17). Video link: https://youtu.be/DByYWTQ6X3E

    Report number: 35737627

    Journal ref: IEEE Transactions on Cybernetics, 23 June 2023

  8. arXiv:2006.08092  [pdf, other

    eess.SY cs.AI cs.RO

    An online evolving framework for advancing reinforcement-learning based automated vehicle control

    Authors: Teawon Han, Subramanya Nageshrao, Dimitar P. Filev, Umit Ozguner

    Abstract: In this paper, an online evolving framework is proposed to detect and revise a controller's imperfect decision-making in advance. The framework consists of three modules: the evolving Finite State Machine (e-FSM), action-reviser, and controller modules. The e-FSM module evolves a stochastic model (e.g., Discrete-Time Markov Chain) from scratch by determining new states and identifying transition p… ▽ More

    Submitted 16 June, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Accepted in IFAC 2020 WC

  9. arXiv:2003.08034  [pdf, other

    eess.SY cs.RO

    Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles

    Authors: Songan Zhang, Huei Peng, Subramanya Nageshrao, H. Eric Tseng

    Abstract: Deep reinforcement learning methods have been widely used in recent years for autonomous vehicle's decision-making. A key issue is that deep neural networks can be fragile to adversarial attacks or other unseen inputs. In this paper, we address the latter issue: we focus on generating socially acceptable perturbations (SAP), so that the autonomous vehicle (AV agent), instead of the challenging veh… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  10. arXiv:1910.12905  [pdf, other

    eess.SY cs.RO

    Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway Driving

    Authors: Ali Baheri, Subramanya Nageshrao, H. Eric Tseng, Ilya Kolmanovsky, Anouck Girard, Dimitar Filev

    Abstract: In this paper, we present a safe deep reinforcement learning system for automated driving. The proposed framework leverages merits of both rule-based and learning-based approaches for safety assurance. Our safety system consists of two modules namely handcrafted safety and dynamically-learned safety. The handcrafted safety module is a heuristic safety rule based on common driving practice that ens… ▽ More

    Submitted 23 April, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

  11. arXiv:1904.00035  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    Autonomous Highway Driving using Deep Reinforcement Learning

    Authors: Subramanya Nageshrao, Eric Tseng, Dimitar Filev

    Abstract: The operational space of an autonomous vehicle (AV) can be diverse and vary significantly. This may lead to a scenario that was not postulated in the design phase. Due to this, formulating a rule based decision maker for selecting maneuvers may not be ideal. Similarly, it may not be effective to design an a-priori cost function and then solve the optimal control problem in real-time. In order to a… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.